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ABSTRACT 

The primary purpose of this study was to determine if 
a model for identifying gifted and talented students could be 
developed which would provide equal access to gifted programs for 
children of all ethnic and economic backgrounds. The culturally and 
ethnically diverse San Diego City School District provided a pool of 
over 35,000 children referred for giftedness whose records were coded 
and analyzed through this research. Based on these findings, a model 
designed to increase the proportion of ethnically and economically 
diverse students referred for assessment and identified as gifted was 
implemented and evaluated, with the Raven Progressive Matrices used 
as the criterion measure of intellectual ability. Component research 
papers by Dennis P. Saccuzzo, Nancy E. Johnson and Tracey L. Guertin 
cover the following topics: the use of the Raven Matrices in an 
ethnically diverse gifted population; use of the Wechsler 
Intelligence Scale for Children - Revised (WISC-R) with disadvantaged 
gifted children; evaluation of risk factors in selecting children for 
gifted programs; information processing in gifted versus nongifted 
African-American, Latino, Filipino, and White children; ethnic and 
gender differences in locus of control in at risk gifted and 
nongifted children; and understanding gifted underachi evers in an 
ethnically diverse population. Appendices include a teacher 
nomination form, a student/parent information form, and an 
independent evaluation review, in which i^uthor Margie Kitano finds 
the new model to have significantly impacted school system practice 
and increased the number and proportion of underrepresented students 
referred and identified although failing to fully meet the initial 
criterion for equal access. (Contains 222 references.) (Author/PB) 
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PREFACE 



The present two volume set represents some of the major findings of a three year Jacob Javits 
grant (#R206A00569) funded by the U. S. Department of Education entitled: Identifying 
Underrepresented Disadvantaged Gifted and Talented Children: A Multif aceted Approach. Volume 1 
presents an overview of the project and includes an invited article for Gifted Child Quarterly by Margie 
Katino of San Diego State University (see Appendix HI to Volume 1). Professor Kitano's article provides 
an independent evaluation of the results of the grant. 

Volume 2 is divided into 6 chapters. These chapters provide the technical basis for the conclusions 
reached in Volume 1 and describe in detail the secondary aspects of the grant. 

This two volume set represents our best efforts to summarize and disseminate our findings 
within the severe time constraints of the termination of our funding as of 12/31 / 93. While fhe information 
provided is of considerable relevance to the attainment of equal access to gifted and talented programs, 
it is important to emphasis that we have far more data than it was possible to present in analyzed form 
by the deadlines under which we operated. We plan to continue to refine the various manuscripts that 
comprise Volume 2 and to add additional manuscripts to augment and support the findings presented 
herein. 

Tne authors are indebted to numerous individuals. We would like to thank our technical momtor, 
Norma Lindsay for her strong support. We are also indebted to the San Diego Unified School System, 
to gifted and talented program administrator, David Hermanson, and to the School Psychologists who 
administered the thousands of tests that provided the basis for our analyses. We would also like to 
thank the more than 30 special shidies students who enrolled for independent study tinder Professors 
Johnson and Saccuzzo. These excellent students aided in the data collection and data input, and in 
return received invaluable hands-on-experience in dealing with real-world research problems. We would 
also like to thank Arlysse KierUe for her role in typing several versions of the various manuscripts, and 
to Susan McLaughlin for her role in the data collection and shidy of locus of control. Helen Veinbergs 
and Chris Bemet also played a significant role in the early stages of the data collection process. In 
addition, numerous San Diego State and University of California, San Diego shidents participated in 
professional conventions by presenting preliminary findings based on data collected for this grant. 

Nancy E. Johnson's predoctoral studies were funded, in part, by funds from this grant. Her 
doctoral dissertation is included in its entirety in Volume 2. In addition, this grant funded a year of full 
time postdoctoral shidy for Dr. Johnson. Dr. Johnson participated in aU phases of this work and conducted 
the lion's share of the data analyses. 

Although many people played a critical role in this project, the authors are solely responsible 
for its contents. We welcome comments and criticisms, and will do our best to incorporate feedback 
into any final published manuscript. We can only hope that in some small way the present findings 
contribute to our tinderstanding of identifying traditionally tmderrepresented gifted children and 
achieving equal access in selecting children from all walks of life for gifted programs. 

Dennis P. Saccuzzo 
Nancy E. Johnson 
Tracey L. Guertin 

12/17/93 
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IDENTIFYING UNDERREPRESENTED DISADVANTAGED GIFTED AND TALENTED 
CHILDREN: A MULTIFACETED APPROACH 



Grant #R206A00569, U.S. Department of Education, Jacob Javits Gifted and Talented 

Discretionary Grant Program 



Students from diverse social, cultural, linguistic, and economic backgrounds are consistently 
tinder identified for Gifted and Talented Education (GATE) programs in every major city in the nation. 
More specifically, there are systematic discrepancies between the percentages of ethnic minority students 
in GATE programs and their percentages in their respective school districts. The problem of the under 
identification of ethnic minorities in GATE programs is broad and persistent. As Richert (1985, 1987) 
and Colleagues (Richert Alvino, & McDonnel, 1982) have repeatedly noted, figures reported by the 
U.S. Department of Education's Office of Civil Rights reveal that groups such as African- American and 
Latino/ Hispanic are imder represented by as much as 70% in gifted programs throughout the nation. A 
study in California found discrepancies between the percentage of ethnic minorities in GATE programs 
and the percentages in their respective school districts in each of the 193 school districts that were 
evaluated over a three year period (Sunset Review Advisory Committee UI Report, 1986). 

Equal access means that children from diverse ethnic and economic groups are evaluated and 
selected for GATE programs in proportion to their numbers in the district as a whole. The primary 
purpose of "Identifying Underrepresented Disadvantaged Gifted and Talented Children: AMultifaceted 
Approach", a grant supported by the Jacob Javits Gifted and Talented Discretionary Grant Program of 
the U.S. Department of Education, was to determine if a model of selection could be developed that 
would provide equal access to gifted programs for children of all ethnic and economic backgrounds. 
The San Diego City School District, which is among the most diverse in the country, provided an excellent 
site for determining if the lofty goal of equal access could be achieved on a large scale basis. The district 
has over 123,000 children, with approximately 29% Latino/Hispanic, 38% Caucasian, 16% African- 
American, and the remainder in significant numbers across five additioncd ethnic backgrounds (see 
Figure 1). 

Figure 1. 1992/1993 district ethnic composition: San Diego Unified School District 
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There were a number of limiting factors in the present effort to develops equal access model 
of selection. First and foremost, IT WAS NECESSARY TO APPLY A CONSISTENT APPROACH TO 
SELECTION ACROSS ALL ETHNIC BACKGROUNDS. The present project was not simply to 
determine if more underrepresented children could be identified for GATE programs, given some special 
or new procedure appUed specifically to that group. It was an attempt to determine if a single standard 
could be applied to the entire population and produce an equal access result. Special ethnic norms 
could not be used, and were unacceptable to all elements of the community. If something special was 
being done for one group, it had to be applied to all. 

A second limiting factor was that THE METHODS OF IDENTIFICATION WOULD BE 
OBJECTIVE AND RELIABLE. This limitation ruled out subjective rating systems and other approaches 
whose results could not be rigorously repeated. 

The third limiting factor was that THE SELECTION PROCESS WOULD HAVE TO BE 
PRACTICAL AND COST EFFECTIVE. Given a school system of over 123,000 children, as many as 
5-10,000 ciiUdren had to be evaluated yearly A procedure was needed that could successfully accomplish 
this evaluation process without the need for a small army of trained professionals, which was noi 
available. 

A fourth limiting factor was that the gifted program was an ACADEMIC PROGRAM, based 
on high intellechjal ability or high achievement. There were no programs available for other types of 
giftedness, such as exceptional artistic or musical ability. 

The final limiting factor was that THE PRESENT STUDY WAS INITIATED WITHIN A 
SCHOOL DISTRICT THAT ALREADY HAD A LARGE, THRIVING GIFTED PROGRAM DATING 
BACK TO THE 1950'S, WITH AN ESTABLISHED SYSTEM OF IDENTIFICATION ALREADY IN 
PLACE AND AN ADMINISTRATIVE UNIT TO IMPLEMENT THIS SYSTEM. 

Within these limitations, the present shidy provided one of the most extensive, if not the most 
extensive, shidy of the efficacy of standard psychometric tests in providing a uniform standard of 
identification for giftedness across ethnic background. When the proposal was first funded in October 
of 1990, the traditional identification procedures used by the school system, which rslied heavily on 
standardized group and individual achievement and intelligence tests, was weU underway. The two 
primary measures ire use were the Developing Cognitive Abilities Test (DCAT), a group test of verbal, 
quantitative, and spatial aptihide, and the Wechsler Intelligence Scale for Children-Revised (WISC-R), 
the most widely used individual test of intelligence for chUdren. The process of implementing the 
present project thus began with an evaluation of these two tests and the identification procedures that 
were in use. At the same time, the efficacy of the Raven Progressive Matrices Test, a culhirally reduced 
measure of general intelligence, was evaluated. Prior to reporting on the results of these evaluatioi^, 
the model that was ultimately recommended, and the many practical problems that were encountered, 
it is importaxit to note the secondary goals of the present project. 

Secondary goals for the present project were as foUows: (1) identify and select greater numbers 
and an increased proportion of underrepresented disadvantaged children; (2) test the efficacy of a battery 
of nontraditional micro-computerized information-processing tasks and determine, if any, the umque 
information-processing strengths of underrepresented chUdren; (3) evaluate the efficacy of measures of 
locus of control in selecting underrepresented children for gifted programs; and (4) utilize archival data 
to test hypotheses about ethnic differences in the pattern of inteUechial strengths. In meeting the goab 
of tiie study, the records of more than 35,000 children referred for giftedness were coded and analyzed. 

The report will begin with an analysis of the primary objective: To develop an objective, reUable 
method of evaluating large numbers of children for giftedness that wUl result in equal access. Each of 
the four secondary goals will be addressed in Volim\e 2. 
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1. The Goal of Proportional Representation in 8 Gi fted Pr ogram. 

During the early months of the study, we monitored tlic -election method that already had been 
in progress. The method in use had evolved over several years, a^d had, in fact, i^ulted in significant 
increases in the percentage of nonwhites being selected for the gifteO, programs (see Figure 2). As Figure 
2 shows, the population of nonwhites selected for the gifted progts had increased substantially, in 
terms of both proportion and numbers. 

Figure 1 White and Nonwhite GATE certified students, 1982/83 - 1989/90 
TDOOr 




A, Identification Procedures in Use Prior to the Study 

The process in use was as follows. Children were nominated for evaluation of giftedness by 
teachers, principals, parents, or self. In addition, central nominations were made at the GATE office of 
children who had obtained at least one score in the 8th or 9th Stanine on the Califorrua Test of Basic 
Skills (CTBS), a standardized group achievement test that was given to each child in the district at 
regular intervals. The purpose of central nominations was to increase referrals for potentially qualified 
African-American and Latino/Hispanic children, as these two groups were historically imdenepresented 
in the nomination process. Next, all nominated children were given the Developing Cognitive Abilities 
Test PCAT). The DCAT was group administered at the school site. All children who obtained a score 
at the 90th percentile or above on the DCAT were then referred for individual testing by one of seven 
(later reduced to five) school psychologists. The psychologists had a choice of administering a Wechsler 
Intelligence Scale for Children-Revised (WISC-R), Kaufman Assessment Battery for Children (K-ABC), 
or Stanford-Binet Intelligence Scale, Fourth Edition. For all but a very small percentage of children, the 
WISC-R was used. 

To qualify for the gifted program based on intellectual achievement, a child had to obtain a 
score of 130 or greater (i.e., at least 2 standard deviations above the mean) on the WISC-R. In addition, 
the selection model took potential risk factors into accoimt. Sbc risk factors were considered: (1) economic 
disadvantage (e.g., poverty); (2) cultural differences (e.g., limited experience with the dominant culture); 
(3) language (e.g., primary language of parent or student is other than English); (4) environmental 
disadvantage (e.g., high crime area, overcrowding, noise); (5) social/emotional (e.g., separation, divorce, 
or death of a parent; adjustment problems); and (6) health ( e.g., asthma, childhood cancer, etc.). More 
recently, the district combined cultural and language risk into a single category. 

Risk factors were evaluated by the use of self-report questionnaires (see Appendix I and H) sent 
to teachers and parents. The data were then evaluated by a school psychologist. If it was determined 
that a child had two or more risk factors, and the child had a Full Scale WISC-R IQ score less than 130 
but greater than or equal to 120, the child was admitted into the gifted program on the basis of high 
potential The consideration of risk factors was applied equaUy across ethnic backgrounds, and so was 
consistent with the single standard requirement in the selection process. 
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While the selection model described above had resulted in a steady increase in the proportion 

of nonwhites selected for the gifted program, careful monitoring within the context of the present study 

revealed that two groups, African- American and Latino /Hispanic, v^ere consistently underidentified* 

Figure 3 shows the percentage of children selected for the gifted program as a function of etiinic 

backgroimd using the risk factor system and the WISC-R as the final criterion measure. 

Figure 3. Et>inic composition of those certified gifted by San Diego Unified School District 
on the basis of WlSC-R scores 
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As the figure shows, the WISC-R overselected Caucasians at about 100% greater than their 
numbers and underselected African-Americans and Latino/Hispanics at a rate of 2 to 4 times less than 
would be expected based on their numbers. In Volume 2, we present an in depth analysis of the WISC- 
R in selecting for giftedness. Our results unequivocally revealed that there is no model for using the 
WISC-R that can result in an equal access program, short of ethnic norms. Since no such norms exist, 
our data unequivocally demonstrated that the WISC-R will always result in a biased selection in favor 
of Caucasian and against African-American and Latino/Hispanic children. Our data revealed that the 
WISC-R and other highly correlated standardized intelligence tests cannot be used in an unbiased manner 
as a standard in selecting diverse groups of children for giftedness, 

B. Monitoring The System to Obtain Equal Representation in the Evaluation Process. 

In monitoring the system in use when the present project was first funded, a major problem 
was uncovered: under representation in the nomination process. African-American and Latino/Hispanic 
children, the two imderrepresented groups, were not being nominated for evaluations in proportion to 
their numbers in the district as a whole. This imder nomination of African- American and Latino/^ 
Hispanic children represented a major obstacle to the development of an equal access program. 

In dealing with each of the obstacles to an equal access program, our approach was to analyze 
the situation and get to the root of the problem. We discovered that in the more affluent, predominantly 
white schools, the gifted program was seen as providing superior educational opportumties for children. 
Consequently, white children were referred by parents, or by teachers often with pressure from parents, 
in far greater nimibers than would be expected based on the proportion of whites in the district as a 
whole. To make matters worse, there seemed to be a general lack of interest in the gifted program iii 
many of the less affluent schools with high concentrations of Latino/Hispanic and African-American 
children. 

One solution to the under nomination problem would have been to screen all children for the 
gifted program, such as all third graders. In fact, this is exactly what we proposed. Based on preliminary 
positive findings with the Raven Progressive Matrices Test, we proposed that all third grade children be 
given ti\e Raven Progressive Matrices Test as an initial estimate of cognitive abilities. This proposal was 



rejected by school administrators on the grounds that it would reintroduce comprehensive intelligence 
testing in the school district. Despite assurances of confidentiality, some educators in the system feared 
that test scores would somehow get back to teachers and create nega' i^ e self-fulfilling prophecies in 
low IQ-scoring children* 

We subsequently turned to a monitoring system, which proved to be quite effective. Each 
month we gathered and analyzed the data pertaining to the proportion of children tested as a function 
of ethnic background. We then created bar graphs that compared the proportion of children tested to 
their proportion in the district for each ethnic background (see Figures 4 and 5). 

Figure 4. Percent tested versus percent in the district 3/1/91 Rgure 5. Percent tested versus percent in the district 4/10/91 




Figures 4 and 5 illustrate this approach. For each ethnic backgroimd there are two bar graphs. The first 
shows the percentage of children from that ethnic background who were nominated and tested for 
giftedness. The second shows the actual percentage of children from that ethnic background in the 
district as a whole. These figures provided clear, graphic data that illustrated that African- Americem 
and Latino/Hispanic children were underrepresented in the nomination process, and that white children 
were overrepresented. The monitoring procedure led to an awareness of the problem within the school 
district that ultimately led to change. 

A major effort was made to increase referrals for qualified African-American and Latino/ 
Hispcmic children. Training sessions were held for teachers. The teachers v/ere made aware that the 
two underrepresented groups were not being referred in proportion to what would be expected based 
on a model that assumes giftedness is evenly distributed in the population across ethnic backgrounds. 
They were given instruction in cttltural differences and participated in discussions on how to spot 
potential giftedness in African-American and Latino/Hispanic children. In addition, a central nomination 
procedure was used at the GATE Administrative Office in which all hdgh achieving African- American 
and Latino /Hispanic children, as evidenced by scores above the 50th percentile on standardized 
achievement tests, were included in the screening process. Finally, the GATE psychologists took an 
active role in soliciting nonunations from individual schools. For example, one psychologist reported 
going to a predominantly African- American school where not a single African-American had been 
nominated for giftedness testing. The psycliologist insisted that the school provide its top 100 African- 
Americans for evaluation. Of these, 33 scored two standard deviations above the mean on the Raven 
Progressive Matrices Test! 

Figures 6, 7, and 8 illustrate the success of the monitoring in terms of equal access in the 
evaluation process. The figures show the proportion of Latino/Hispaiuc, African-American, and 
Caucasian children evaluated, respectively, for three time periods: 9/89-6/90 (a baseline measure that 
represents the proportion tested prior to the monitoring process), 9/90-6/91 (the first year of the project), 

er|c ^ ^ 



and 9 /91-6/93. For each bar graph, proportional representation is determined by dividing the proportion 
of children evaluated for any given time period by *eir proportion in the district as a whole. 



As inspection of Figure 6 indicates, the 
proportion of Latino/Hispanic children 
nominated for evaluation of giftedness increased 
steadily throughout the three periods, from just 
over 0.6 prior to the monitoring process to 
approximately 0.8 by the end of the first year to 
nearly 1.0 as of this writing. For the first time in 
the history of the San Diego Unified GATE 
program, Latino/Hispanic children are being 
nominated and evaluated for the gifted program 
in proportion to their numbers in the district. 



figure 6. Proport'otute reprisentabon of Utmos/Hispinics evaluated, 
as opposed to their representation in the population 
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Figure 7, Proportionate representation of African-Amenans evaluated, 
as opposed to their representation in the population 
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Inspection of Figure 7, which illustrates the 
proportion of African-American children 
referred and evaluated for each of the three 
time periods, shows that there was a steady 
increase in the proportionate representation of 
African-Americans. 



Finally, Figure 8 reveals that the proportion of 
Caucasians referred and evaluated fell from a 
substantial overrepresentation to a slight 
overrepresentation at the present tin\e. 



Figure 8. 
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Proportionate representation of Caucasians evaluated, 
as opposed to their representation in the population 
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Figure 9 provides a summary of the proportion of children nominated and evaluated for the 
gifted program for each of the three time periods through 6/93 for six ethnic backgrounds: Latino/ 
Hispanic, African- American, White, Native American, Filipino, and Indochinese. As the figure shows, 
whUe there was never a perfect representation, it was possible to ameliorate all major inequities. From 
our data and experience, we draw the following conck^sion: The attainment of equal representation 
in the proportion of children nominated and evaluated for gif tedness across ethnic background is a 
realistic and readily attainable goa? for any school district. 



Figure 9. 
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C. The Shift to the Raven Progressive Matrices Test 

As previously indicated, our monitoring of the selection process quickly revealed the 
inappropriateness of the Wechsler Intelligence Scale for Children-Revised (WISC-R) as the standard 
for giftedness in the ethnically diverse San Diego City School District (see Figure 3). Using ongoing 
records as well as archival data from approximately 15,000 cases evaluated for giftedness between 
1985 and the present we conducted a thorough analysis of tlie WISC-R DetaUed results of this analysis 
appear in Volume 2. In brief, the results revealed that there was no model (e.g., weighted subtests, use 
of different subtests for different ethnic groups, etc.) of use of the WISC-R that produced equal access. 
VS/hen we provided this information to the GATE psychologists and administrator in early 1991, the 
district modified its selection policy and added the Raven Progressive Matrices Test as one of the 
standardized tests that could be used to certify a child as gifted. This modification represented 
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a change in the original design of our shidy, which had caUed for one full year of monitoring to establish 
a baseline. Those who were in charge of the San Diego City Schools GATE program decided that since 
we knew the procedures presently in progress were doomed to fail in terms of providing equal access, 
th«»re was a need for an instant poUcy change. From a scientific standpoint, a midyear shift in selection 
procedures might reduce the rigor of our ultimate findings. From a practical standpoint, however, it 
niade no sense to continue using biased procedures. 

1. Characteristics of the Raven 

The use of the Raven Progressive Matrices (RPM) test was indeed weU founded. As Carpenter, 
Just, and Shell (1990) put it, the Raven provides a measure of "the ability to reason and solve problems 
involving new information, without relying extensively on an expUcit base of declarative knowledge 
derived from eitiier schooling or previous experience" (p. 404). TheoreticaUy the Raven provides a 
measure of "fluid intelUgence", in contrast to "crystaUized inteUigence", which reflects previously 
acquired skills (CatteU, 1963). IDespite its relative independence of previous experience and its nonverbal 
format. Raven test scores correlate highly with measures of inteUe-tual achievement (Court & Raven, 
1982), which suggests tiiat the underlying processes are general rather than specific to this one test. 

For years researchers in the United States had pointed to ihe RPM as among the most promising 
of the nontraditional approaches for assessing giftedness in ethnically diverse groups. The RPM Provides 
a nonverbal measure of inteUectual functioning that minimizes the effects of language and culhire (Baska, 
1986; James, 1984; Powers & Barkan, 1986; Carpenter et al, 1990). Until recently, however, the RPM 
could not be applied on a widescale basis due to the absence of adequate norms. 

In 1986 the manual for the RPM was upd ated along witii the pubUcation of an impressive set of 
nonns that included smoothed norms for Americans, a variety of ethnic norms, and international norms 
from major world cities (Raven and CoUeagues, 1986, 1990). Moreover, as Baska (1986) has noted, the 
RPM not only has been shown to be effective in identifying gifted minority students, it has also correlated 
well with success in the Chicago City School System. A variety of reseaid^ shidies have supported the 
use of title RPM for children from culturally diverse backgrounds (Karnes, Lee, & May 1982; Powers, 
Barkan, & Jones, 1986; Sidles & MacAvoy 1987). 

There are several form^ of the Raven, including a colored form for very young chUdren, the 
Standard 60 item test, and the advanced test for the very highest levels of abUities in adolescents and 
adults All forms consist of tiie same type of problem, tiie incomplete mabix. The child is shown a 
design with a distinct pattern that may be based on form, number, size, and a vanety of other orgamzing 
principles. A part of the pattern is omitted and the child must select from among six to eight alternatives, 
and choose the one that acoorately completes the pattern. 

According to a detailed tiieoretical and empirical analysis by Carpenter et al (1990), \he Raven 
measures a basic ability underlying inteUigence as foUows: "to decompose problems into manageab e 
segments and iterate through them, the differential ability to manage the hierarchy of goals and subgoals 
generated by titis problem decomposition, and the differential ability to form higher level abstractions 
(p. 429). 

An extensive body of research reported in tiie test manual reveals that the RPM is about as 
reUable as the WISC-R. VaUdity stiidies have revealed that the RPM measures general mtelUgence, and 
is perhaps the single best measure of Spearman's g factor (Marshalek, Lohman, & Snow, 1983; Snow, 
Kyllonen, & Marshalek, 1984). 

To digress for a moment. Spearman's g factor (Spearman, 1927) is one of the most robust findings 
in the field of testing. Its existence is based on the weU known phenomenon that aU tests of mtelligence 
and scholastic aptihide are positively correlated, witii a general range from about .60 to .90, and a rnean 
coefficientofabout.75(seeCarro' 1992; Humphreys, 1992; Jensen, 1992). men the matrix of correlations 
among diverse but correlated tests of inteUigence and aptihide are subjected to a hierarchical factor 
analysis it is ahnost always found that at least half the variance can be accounted for by a common 
factor, which Spearman (1927) caUed g and interpreted in terms of general mental energy. 
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Because the RPM measures g, it correlates with other measures of general inteUigence including 
language abiUties and reading, even though the RPM itself contains no language or reading problems. 
Moreover, because it does not involve reading, language, or other aspects of acquired or c^^^^ 
inteUigence, the Raven is a far better measure of pure potential than tests such as the WISC-R, whose 
scores depend heavily on acquired knowledge. 

In Volume 2 we present a number of important findings pertaining to the Raven that further 
support its vaUdity. For now, just one example wiU be given. From our database we evaluated the 
records of more than 2,000 children who had been given both a Raven and a WISC-R. Scores on the 
Raven and WISOR were then correlated with scores on the California Test of Basic Skills (CTBS), a 
Standardized Test of Achievement in Language, Reading, and Math. For both African-Amencan and 
White children, the Raven was a better predictor of language achievement than the WISC-R Verbal, 
Performance, or Full Scale IQs. Such results are probably due to the Raven's abiUty to evaluate potential 
independently of past learning. 

2. Irutial Shift to the Raven 

By about March or April of 1991, the Raven was added to the evaluation tools for every GATE 
psychologist. InitiaUy, the psychologists took a cautious approach. They tended to use the Raven for 
nominated nonwhite students only after the student failed to met the cut-off on the WISC-R 
Approximately 50% of the nonwhite children who had faUed to qualify based on a WISC-R qualified 
with the Raven. 

3. The Adoption of the Raven as the Primary Certification Tool. 

In September of 1991 the district changed its identification procedures based on the initial positive 
results with the Raven. The DCAT, the group screening test, was completely eliminated from the process. 
The nomination process, which 



had resulted in children being 
referred for giftedness testing 
almost in proportion to their 
numbers, was retained. Also 
retained was the risk factor 
system. The major change Wc.s 
that all children referred for 
giftedness were group-tested v^th 
the Raven and ultimately 
qualified, or did not qualify, based 
on their Raven score. A child with 
only one or no risk factors needed 
a Raven IQ equivalent of 130 or 
greater to qualify for the gifted 
program; a child with 2 or more 
risk factors needed an IQ of 120 
or greater to qualify. 

Figure 10 shows the 
result of the shift to the Raven 
(compared to the WISC-R), and 
the dramatic effect its use had in 
the percentage of traditionally 
underrepresented children 
enrolled in the program. 



Rguie 10. Ethnic composiHon of those certified gifted by San Diego Unified School District 
on the basis of WISC-R (1984-1990) versus Raven's Progressive Matrices {1990-1993) 
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Figures 11 47 show comparisons of the Raven versus the WISC-R for specific populations. Tliese 
figures dramaticaUy illustrate the selection bias in the WISC-R and a corresponding lack of bias in the 
Raven. 



Figure II. Achieving equity in gifted representaiioa Lstino/Htsptmc population 
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Figure 11 reveals that 26.6% of the population are 
Latino/Hispanic. Whereas only 5.6% of the 
children selected for giftedness with the WISC-R 
were Latino/Hispanic, 18.5% of those certified 
with the Raven were Latino/Hispanic. Thus, use 
of the Raven resulted in an increase of over 3307o 
selection of Latino/Hispanic children for the 
gifted program. 



Figure 12. .Achieving equiiy m gifted repreaenttQon. Rlipmo population 



Figure 12 reveals that 8% of the population 
are Filipino. Whereas 4.4% of the children selected for 
giftedness with the WlSC-R were Filipino, 9.9% of 
those selected with the Raven were Filipino. 
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Figure 13. Achieving equity In gifted reprewnuition .aucasiin population 
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In Figure 13 it can be seen that conipared to a 
district population of 39%, 75.5% of the individuals 
selected for the gifted program were Caucasian when 
the WISC-R was used. With the Raven, this 
percentage dropped to 47. 1 . 



Figure 14. Achieving equity m gifted repr«<nUtiOft— — Indochinev popuUtwo 



Figure 14 is quite dramatic. Although 
Indochinese represented 7.7% of the total population, 
only 1 .5% of those selected with the WISC-R were 
Indochinese. With the Raven, there was more than a 
500% increase and these children were selected 
approximately in proportion to their numbers in the 
district as a whole. 
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RgurelS. Achieving equity In gifted rtpresentiticxi AfnarvAmtnan population 
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Figure 15 reveals that compared to a percentage 
of 16.2 in the district, only 6.2% of the children 
selected for the gifted program were African- 
American when the WISC-R was used. This figure 
jumped to 10.4% with the Raven - a 168% increase 
for these traditionally underrepresented children. 



Figure 16 shows the familiar pattern of 
marked changes toward increased equity with tiie 
Raven. Comprising .7% of the district. Pacific 
Islanders represented only .2% of the gifted 
population selected with the WISC-R - an 
underselection of 350%. With the Raven, these 
children were selected at slightly above their 
proportion in the district. 



Figure 16. Achievii^ equity in gifted represenutioa — i>»cificlslinder popuUtioo 
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Figure 17. Achieving equity m gifted represenutioru Native American population 
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Figure 17 shows that results for Native 
American children paralleled those results 
for Pacific Islanders. From a substantial 
underrcpresentation with the WISC-R, 
these children were selected at a rate that 
was slightly above expectation when the 
Raven was used. 



Figure 18 provides a 
graph of the nonwhite enrollment 
in the GATE program. Notice the 
dramatic, nearly hyperbolic 
increase that occurred in the 
percentage of nonwhites enrolled 
in the GATE program between 
1990 and 1993. 



Figure 18. Nonwliite student enrollment in GATE, 1986 - 1993 
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Figure 19 shows the growth in numbers of Latino/ Hispanic and African- American children 
certified as gifted for each year between 1989 and 1992. Notice that the growth in Latino/Hispanic 
children enrolled in the GATE program multiplied almost 70-fold from 1989 to 1992. The African- 
Americans, likewise, saw their rate of growth increase dramatically, with approximately an 8-fold increase 
between 1989 and 1992. Jn 1989 40 African-Americans were added to the GATE program. By 1992 that 
number had grown to 350, while the proportion of African-Americans in the district remained steady 
For Latino/Hispanic chUdren, almost 700 were added to the GATE program, compared to less than 10 
for 1989. 

Figure 19. Growth in numbers of traditionally underrepresented children certified for GATE 
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D. Emergent Problems 

The dramatic increases in the number and proportion of Latino/Hisparuc and African-American 
children was not uniformly weU-received. In fact, there was a poUtical upheaval among some of the 
more affluent white parents. Recall that many white parents viewed the gifted program as a superior 
educational system for their children. Even though Caucasian children continued to be evaluated and 
certified at least in proportion to their numbers, many white parents, including some whose own child 
failed to be certified for the GATE program, complained forcefully 

Some parents wrote to the school board, while others caUed for the termination of the GATE 
administrator and the present research program. Some parents recniited psychologists and statisticians 
from the community to critique and refute the Raven. The Los Angeles Tmies did an investigation and 
wrote an article. In the end, there was a move to combine the GATE program with the special 
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education program, presumably so that there could be a return to the old order. The move to combine 
the two programs was successful, and the GATE program joined special education in an Exceptional 
Programs Department. However, there have been no changes to date in the methods of identification 
outlined herein. 

A second problem that emerged was that teachers of GATE children were not prepared to deal 
with the large influx of high potential African-American and Latino/Hispanic children who, whUe 
gifted, did not fit the mold of the more affluent children who had previously been certified as gifted 
with the WISC-R. Prior selection procedures, in using tests such as the WISC-R and other verbally 
weighted standardized tests of crystallized abiUty, tended to identify a relatively homogeneous population 
of high achieving, well-motivated, verbaUy mature children. While many of these children are indeed 
of high potential, a significant proportion are simply verbally advanced due to the enriched expenences 
available to affluent children. In contrast to these verbaUy mahire children, there exists a significant 
number of disadvantaged children with extremely high potential, but limited achievement. 

In switching to the Raven Progressive Matrices Test, we began to identify children with high 
potential, but not necessarily high achievement. The result was greater diversity in the classroom, but 
new problems for teachers. To help teachers deal with this new diversity, we held seminars for teachers. 
However, much more effort wiU be needed if these more rulhiraUy diverse children are to be successfuUy 
integrated into the gifted program. 

In Volume 2, we present the initial drafts of research manuscripts that have resulted from this 
project. WhUe it is our intent to pubUsh each of these manuscripts in professional journals, we present 
them in Voliame 2 in order to provide rapid dissemination of our findings to interested readers. In 
Chapter 1 of Volume 2, we present our major findings pertaining to the Raven. Chapter 2 represents a 
doctoral dissertation pertaining to the WISC-R written by Nancy E. Johnson in partial fulfillment of the 
requirements of the Doctor of Philosophy degree. 

Chapter 3 provides an overview of data relevant to risk factors. In Chapter 4 we present our 
major findings pertaining to information-processing. Chapter 5 discusses our results relevant to locus 
of control. FinaUy, Chapter 6 provides a discussion of the characterisucs of gifted underachievers. 

Our primary limiting factor is that handing for this grant ended on 12/31/93. Given the 
considerable amount of data that we have accumulated, we wiU continue to analyze and dissemmate 
our findings during the next few years. 
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APPENDIX I 



San Diego City Schools 
Educational Services Division 
Gifted and Talented Education 



TFACHER NQ MTN ATTON FORM 



Date 



j^an^e Birth Date Sex ^Ethnic Code. 

g^ool Grade ^Track Room Number_ 



f^OCIAL/ENVTRONMEN TAT . VARIABLES 
Please check all items that apply: 

Lacks preschool/kindergarten experience 

Irregular attendance 

Transiency (3 or more school moves) 

Limited home enrichment opportunities (avaUabiUty of books, periodicals, 

family interaction, family outings) 
Home conflicts: 



2. 




Responsibilities and study time 
Excessive child care responsibility 
Working to help support family 
Overcrov^ding — no study area 
Inconsistencies in the home 



Economic hardship 

Single parent head of household 

Unemployment 

3. LANGUAGE 

Primary language of parent and/or student is ether than English 

Not proficient/fluent in English 

Uses non-standard English 

Student enrolled in Second Language Immersion Magnet (SLIM) 

4. CTILTURAL 

Limited home/school communication 

Experience in dominant culture is limited 

Cultural values and beliefs differ from dominant culture 
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SOCIAL/E^^VlRO^IMENTAL VARIABLES 
Page 2 



5. <;or^A V/KMOTTONAL 

Child abuse: physical mental neglect 

Emotional/adjustment problems 

Working with district counselor 

Working with social worker 

Utilizing psychological services 

Other:. 

Significant home factors 

Separation 

IXvorce 

Death 

Extended absence of parent 

Military 

Employment 

Other: 

Family 

Single parent 

Remarriage /step-parent 

6. HEAUH 

Designated instructional services 

PHDIS 

Speech and language 

Vision 

Hearing 

Adaptive RE. 

Severe allergies 

Asthma 

Frequent medical/health referral 

Regularly prescribed medication 

Other: 

Prepared by Recommended? Yes. 

(Teacher) 



Reviewed by. 



Recommended? Yes. 



APPENDIX II 



Student Name: 



(Last) 
Address. 



(City) 
Grade. 



San Diego City Schools 
School Services Division 
Gifted and Talented Education 

<;tI JDENT/PAT^KNT INFORMATION FORM 

Date 

^Birth Date 



Sex__School_ 



(First) (mi) 



Mother's name. 



(Street) 



Father's name_ 



__Occupation. 
Work Phone_ 
_Occupation_ 



(State) (Zip) 
Room Number 



Track. 



Schools Attended 



Grade 



Work Phone_ 
Home Phone. 

Dates Attended 



1. Names and ages of brothers and sisters:. 



2. Describe your child's attitude toward school:. 



3. List any special interests, talents, and skills your child may have:. 



4. \^at special lessons, traming or learning opportunities has your child had outside of school? 



5. To help us know more about your child, please check any of the following that apply: 



□ allergies 

□ asthma 

□ frequent absences 

□ prescribed medications 

□ parent in military 



□ frequent parent absence 

□ parents separated 

□ single parent 

□ remarriage /step-parent 

□ recent death /significant 
illness in family 



□ 3 or more schools attended 

□ no kindergarten or pre- 

school experience 

□ additional language(s) 
spoken in home 



List:. 



6. Has your child been previously assessed? □ yes □ no If yes, when? 

7. What other things would you like us to know that would assist us in assessing your child? 



Name of person Relationship 
completing this form shident_ 
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APPENDIX III 



In Search of An Equal Access Model: 
Review of Saccuzzo's "Identifying Underrepresented 
Disadvantaged GATE Children" 



Margie K. Kitano 
San Diego State University 



— Invited article submitted to Gifted Child Quarterly 



Abstract 



"IdexiUfying Underrepresented Disadvantaged Gifted and Talented Children: 
A Multifaceted Approach" is a collaborative researcli and development project 
that examined alternative procedures in search of an equal access model for 
identifying imderrepresented gifted students. A review of the project was 
conducted using interviews with key participants and project-generated 
manuscripts and pubUcations. The project analyzed archival and current data 
on some 35,000 shidents to evaluate a large, urban school district's selection 
process from referral to certification as gifted. Based on the findings, a new 
model designed to increase the proportion of ethiucally and economically diverse 
students referred for assessment and identified as gifted was implemented and 
evaluated. The new model incorporates the Raven Progressive Matrices as the 
criterion measure of inteUectual abiUty. Project data demonstrate that the new 
model has increased the number and proportion of underrepresented students 
referred and selected for gifted programs but has not met the criterion for 
providing equal access. Use of the Raven has led to certification of shidents 
with high cogrutive ability, although not necessarily commensurate academic 
achievement, and to a decrease in the proportion of mainstream students certified 
as' gifted. The review found diverse perspectives on the interpretation of these 
outcomes. It is clear that the project has had major impact on district practices 
and has increased access for gifted shidents from ethnicaUy, linguisticaUy, and 
economically diverse backgrounds. 



In Search of An Equal Access Model: 
Review of Saccuzzo's "Identifying Underrepresented 
Disadvantaged GATE Children" 

In 1982, what is now the eighth largest school district in the nation observed a significant racial 
imbalance in its gifted and talented education (GATE) program and set a course for change. The 
district had an early awareness of the growing ethnic and linguistic diversity of its population and the 
underrepresentation of nonmainstream students receiving services for the gifted. To find more equitable 
identification and selection procedures, the GATE leadership conducted a comprehensive search of 
extant literahire and consulted other large urban districts, state and federal departments of education, 
and national leaders in the field- Concurrently, the GATE program instituted a central nominating 
process and inservice opportunities for teachers and administrators, requested that a team of school 
psychologists be assigned to the GATE program, and sought to diversify its staff. Risk factors (e.g., 
environmental disadvantage) that impact test taking performance were added to standardized test 
criteria for certification as gifted. 

These changes significantly increased the numbers of diverse students served in programs for 
the gifted. However, they continued to be underrepresented relative to their proportions in the district. 
MeanwhUe, the state's recession-related financial crisis combined with competing concerns voiced by 
unions, community advisory groups, parents, and educators seriously threatened to dismantle the 
GATE program. Within this context, the Javits-hmded project titled "Identifying Underrepresented 
Disadvantaged Gifted and Talented Children: A Multifaceted Approach" began in the faU of 1990 to 
estabUsh an equitable identification procedure with strong empirical documentation. According to the 
senior GATE administrator, had it not been for the timely implementation of the Javits project, "there 
would be no gifted program in this school district" today. 

This article provides a review of the project based on two major sources of information. First, 
manuscripts, publications, and documents from conference presentations given by project and school 
district staff were examined (Saccuzzo, 1993; Saccuzzo & Johnson, 1992, undated; Saccuzzo, Johnson, 
& Guertin, 1993; Saccuzzo, Johnson, & Russell, 1992). Second, the reviewer conducted face-to-face or 
telephone interviews with the three key project staff members and with school district personnel 
including the senior GATE administrator, three GATE school psychologists and counselors, and four 
elementary and secondary school principals. The interview findings suggest that project activities and 
findings c^ be objectively described. Further, most informants agree that the project has dramatically 
impacted school district policies and procedures with regard to increased enrollment of students from 
ethnicaUy diverse backgrounds in programs for the gifted. At the same time, these changes have 
generated healthy debate regarding what constitutes an equitable identification process as weU as 
what constitutes gifted potential. This review begins v;ith a description of the project and its fmdmgs 
and then examines multiple perspectives regarding project outcomes and future chaUenges. 

Project Description 

"Identifying Underrepresented Disadvantaged Gifted and Talented Children: A Multifaceted 
Approach" represents a 3-year cooperative research and development effort between a large urban 
school district and a comprehensive state university. Deimis R Saccuzzo, professor of psychology at 
San Diego State University, serves as prinicipal investigator and director. The San Diego Unified School 
District's Gifted and Talented Education program, under the leadership of David R Hermanson, 
functions as the coUaborating partner. The project focuses on evaluating strategies for identifymg for 
Kifted programs shadents traditionally underrepresented in such programs; coUateraUy, it considers 
Ae realistic needs of a highly diverse school district in a state undergoing severe economic stress. The 
district serves over 123,000 shadents, of whom approximately 29 percent are Latino, 38 percent white 
non-Hispanic, and 18 percent African-American. The remaining 15 percent are of other backgrounds, 
primarUy FiUpino, Indochinese, Asian, Pacific Islander, and American Indian. This review was 
conducted in the latter half of the project's third and final year. 
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Purposes 

The project has as its primary purpose to determine whether an objective, reliable selection 
model can be developed that will provide equal access to gifted programs for children of all ethnic and 
economic backgrounds. Equal access is defined as selection of students in each ethnic or cultural group 
in proportion to their numbers in the school district as a whole. The project also sought to enable the 
school district to identify and select for gifted programs greater numbers and proportions of 
underrepresented and disadvantaged students as defined by Javits legislation. 

Three addibonal goals fociis on exploring characteristics of underrepresented students on various 
assessment devices and evaluating the potential of nontraditional measures for identifying students as 
gifted. Specifically, project staff have analyzed archival data on 20,000 chUdren referred for gifted 
assessment since 1984 to test hypotheses about ethnic differences in patterns of intellectual strengths on 
the WISC-R. The data bank now includes information on some 35,000 elementary to . -^h school-age 
students. Additionally, thev sought to test the efficacy of a battery of nontraditional micro-computerized 
information-processing tasks and determine the information-processing characteristics of ethnically 
diverse students. Finally, they investigated the potential of a locus of control measure for identifymg 
giftedness in target students. 

Procedures and Findings 

The purposes and goals are being implemented by the project's director, post-doctoral feUow, 
and coordinator with the assistance of thirty-four university students who have served as research 
assistants. Three types of data were analyzed: standardized test data on students referred smce 1984 as 
potentially gifted; numbers of students from various ethnic groups referred, nommated, and certified 
over the project's duration as gifted; and performance scores on a battery of nontraditional assessment 
devices. Findings related to each of the five project purposes are described in the followmg sections. 

Evaluating the efficacy of pre-project identification measures: tJie WISC-R. Prior to the project's 
implementation, students were nominated for evaluation by teachers, principals, parents, or self or 
through performance on the California Test of Basic Skills. Nominated students received group 
administration of the Developing Cognitive Abilities Test (DCAT). School psychologists administered 
an individual intelligence test, most frequently the WISC-R, to those students who scored at the 90th 
percentUe or above on the DCAT Because the WISC-R surfaced as the predominantly used criterion 
measure for certification as gifted in the district, the project examined the potential of the instrument for 
unbiased selection. Archival WISC-R data were analyzed for a subset of elementary students referred as 
potentially gifted from African-American, Asian-American, Caucasian, Filipino and Hispanic 
backgrounds. The project found no single model or combination of individual models usmg WlbC-K 
subtest scores that could select equally from each of the five groups. The researchers concluded that no 
single model (i.e , not using ethnic norms) of WISC-R scores will result in an equal access program. The 
data suggest that the WISC-R overselects Causcasian and underselects African-American and Latmo 
students for gifted programs (Saccuzzo & Johnson, 1992, Presentation #4). 

Finding an equal access model and increasing the numbers and proportions of diverse students identified 
as snfted During the first implementation year, the project found that one barrier to equal access .xcurred 
durirte the refe^al and nomination stages. Specifically, school personnel and parents were not nommatmg 
Africa-American and Latino students in proportion to their numbers in the district as a whole. Project 
staff systematicaUy gathered, analyzed, charted, and disseminated to district personnel data on the 
proportion of students tested by ethnic group. In collaboration with GATE persom^el, a program was 
implemented to monitor and share nomination data, provide teacher inservice, and actively engage 
GATE psychologists in soUciting proportionate nominations. The program led to a steady mcrease m 
tiie proportions of African-American and Latino students tested for GATE programs and a decrease m 
the overrepresentation of Caucasian students. 

In addition, given the project's findings on the WISC-R, the district added the Raven Progressive 
Matrices (RPM), 1986 American norms, as one of the standardized tests that could be used to certify a 
shident as gifted. The RPM (Raven & Summers, et al, 1986) is an untimed, 60-item non-readmg test of 
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general mental ability. Each item presents a design with a part missing and several choices from which 
the missing piece can be selected. The RPM can be administered in a group or individual setting. As 
the project progressed, the DCAT was eliminated as a screening measure and a psychologist- 
administered RPM adopted as the prescribed certifying tool. A score of 98 percentile, or 90 percentile 
with two or more risk factors, was retained as the criterion for documenting intellectual ability (see San 
Diego City Schools, 1992 for additional information). Project data demonstrate tiiat, combined witii 
increased ethnic nominations, the shift to the RPM dramatically raised tiie number of diverse students 
enrolled in GATE programs. For example, the number of Latino students enrolled increased 70-fold 
from 1989 to 1992 and the number of African- American students 8-fold over the same period (Saccuzzo, 
1993). Using the RPM, \he project identified for gifted programs over one thousand African-American 
and Latino students who would not have been selected through WISC-R performance. 

Proportional representation in enrollment in programs for the gifted also improved. 
Proportional representation is calculated by dividing the proportion of students enrolled from a given 
group by their proportion in the district as a whole, such ihat 1.0 represents ihe standard criterion. 
Prior to the project's implementation, the representation of white students was 1.5 (i.e., they were 
overselected for gifted programs by .5 relative to their proportion in the district as a whole). The 
representation of Latino students was 0.5 (underselection by .5), and of African-American students 
0.33 (underselection by .67). At ihe time of this review, project data show the proportion of white 
students enrolled in gifted programs as 1.2, Latinos as 0.8, and African-American students as 0.7. 
Additionally implementation of ihe project model has increased ihe proportionate representation of 
Filipino, Indochinese, Pacific Islander, and American Indian students selected for gifted programs and 
has produced a slight overrepresentation of Filipino and Asian-American students. 

The project staff also evaluated records of over 20,000 second- through sixtii-grade students in 
the data base who bad been administered eiiher the RPM or WISC-R or both. These data were correlated 
with language, reading, and rnath scores for ihe same students on ihe Califonrda Test of Basic Skills, 
DCAT, and Abbreviated Staiiford Test of Achievement. Results suggest that ihe RPM provides a better 
predictor of academic achievement, including language achievement, than the WISC-R verbal, 
performance, or full scale IQ scores for African- American and Latino students (Saccuzzo & Johnson, 
1992, Presentation #5). 

Assessing the potential ofnontraditional instruments for identifying gifted students. The project also 
evaluated the efficacy of the Nowicki-Strickland locus of control measure (Saccuzzo & Johnson, 1992, 
Presentation #1) and a battery of micro-computerized information processing tasks for use in selecting 
diverse students as gifted (Saccuzzo & Johnson, 1992, Presentations #2,3; Saccuzzo, Johnson, & Guertin, 
1993). Included in the latter were information processing tasks that measured speed of processing as 
well as tasks not dependent upon speed. In general, these studies produced information about 
differences and surilarities by ethiuc and by abiUty (gifted/nongifted) group. The project team 
concluded that, while some of these measures have potential more work needs to be done in this area. 

Project Outcomes: Multiple Perspectives 

This section examines informants' various interpretations of the project's findings and impact. 
Diverse perspectives were found on two major issues: (a) whether the project successfully identified 
an equal access identification model using a single standardized measure and criterion across ethnic 
groups; and (b) whether the students identified through the model are "gifted." 

Does Proportionate Nomination and the RPM Produce Equal Access? 

The project's major purpose was to determine whetiier a selection model could be developed 
that would provide equal acc ess to gifted programs for students of all ethnic and econonuc backgrounds. 
When asked whether the Raven Progressive Matrices provided an equal access selection model. Project 
Director Dennis Saccuzzo responded "No, if \he RPM is used in a standardized format witii an ethnicaUy 
and economically diverse population. Use of the RPM improves access, but does not produce equal 
access." Project staff have concluded from their data that attainment of proportionate representation 
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across ethnic and economic groups through monitoring and inservice for nomination to gifted programs 
constitutes a realistic, attainable goal. Moreover, proportionate nomination in conjunction with selection 
through use of the RPM significantly increased the numbers and proportion of ethnically and 
economicaUy diverse shidents enrolled in programs for the gifted, thus meeting the second primary 
purpose of the project. However, they evaluate the model implemented in the district through the 
project as not meeting the objective of producing equal access to gifted programs, smce there contmues 
fo be significant underrepresentation (by .3) of African-American students relative to *eir proportions 
in the district as a whole. According to the research team, blanket use of the RPM or any other 
standardized psychometric test will not produce equal access. They hypothesize that use of tne RPM m 
combination with dynamic assessment (Feuerstein, 1979) may lead to an equal access model. 

GATE personnel, under pressure from the district and community to increase the number and 
proportion of ethnicaUy diverse shidents in gifted programs, offer a different perspective on project 
outcomes. When asked if the goals have been met, the senior school psychologist rephed. Yes; very 
nicely especiaUy if you consider the entire process from nomination to certification. I was skephcal m 
the begitming but have been thoroughly won over by the results. All of us feel comfortable gomg out 
anywhere and defending the practice." 

Both project and GATE personnel report that some members of the community have been 
vocally critical of project outcomes. For example, despite the continuing overrepresentahon of white 
nonH^panic students selected for gifted programs, the proportion of white students nommated and 
identified has declined with the implementation of the new model. For this reason, some parents argue 
that the RPM is biased against mainstream students. One principal noted that parents whose chUdren 
qualify ca the WISC-R but fail to qualify on the RPM (and therefore for the GATE program) queshon 
the mo iel's validity. 

Project and GATE personnel also report that members of the African-American community 
have objected to the model because standardized tests traditionaUy have been used to discriminate 
against their children and because the RPM has not eliminated bias in selection. On the other hand, the 
dramatic increase in number and proportion of African-American and Utino students identihed through 
the project has produced many positive comments. The principal of a school with high enrollments o 
Latino students (88%) and studenvs whose first language is not English (50%) reported a substanhal 
rise in the number of shidents from these groups served by GATE and "no complaints from parents. 
Theprincipalofaschoolwhoseshidentsare39peicentAfrican-Americanand29peicentl^tmode^ 
the RPM as an "exceUent instrument" compared to the Wechsler and Binet and offered that parents 
have been extremely pleased and positive." 

Does the Model Identify Gifted Students? 

Informants agree that the combmation of increased nomination and use of the RPM as the 
criterion measure has raised the numbers and proportions of traditionally underrepresented shidents 
in the GATC program. Opinions differ regarding the nature of shidents identified as gifted by the 
Ravens as opposed to the Wechsler. The question has arisen because some of ttie shidents certified by 
the Ravens display achievement characteristics different from those certified by language-Jased teste. 
SpecificaUy, the earUer nomination practices and use of verbaUy weighted este, such as the WIX-R 
tended to identify high achieving, academicaUy motivated shidente. Implemenhng the new modd 
using the RPM leads to identification of shidente with high cognitive potential but not necessarily high 
academic achievement. 

The academic, culhiral, and economic diversity exhibited by shidents 
project's model has presented new challenges to GATE teachers. TT.e pnncipal of one ethmcaUy diverse 
SreSSS that the teachers consider the Ravens "a step oackward" because so many of the shidente 
Sd^omSe academic skills and are "not ready for GATE materials 
sut2Sat''abatterymaybemoreappn,priate.I'mnotsureifwear.tnJy^^^ 

sE^The RPM maV be identifying shidente whose homes have provided expenence with puzzles. 
. . it identifies more boys than girls, perhaps because it measures spatial abihty. 
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The principal of a school with high African- American and Latino enrollments found the GATE 
teachers "extremely positive" about the changes. Nevertheless, one teacher at this school experienced 
extreme distress from working with "too many who didn't belong." The principal of the school with 
predominantly Latino enrollment "had no questioi\s from teachers." 

Interestingly, these principals reported different uses of the RPM. For example, the first principal 
quoted above has the RPM administered to all second- and fifth-grade students. In other schools, the 
RPM is administered only to those students noLiinated by school personnel and parents. The principal 
of a school with high enrollments of Latino and Indochinese students reported difficulty in assessing 
changes resulting from use of the RPM since identification of gifted students at this school "continues 
to rely on several versus one measure." 

Based on the testing literature, project staff evaluate the RPM as a "far better measure of pure 
potential than tests such as the WISC-R, whose scores depend heavily on acquired knowledge" (Saccuzzo, 
1993, p. 14). GATE staff suggest that "Teachers need to understand that they're getting excellent thixOcers 
versus achievers." 



Triumphs and Remaining Challenges 



Triumphs 

Despite the diversity of perspectives, this project cleariy has had striking impact on shidents 
from groups traditionally underrepresented in programs for the gifted. Most significantly, the project 
resulted in immediate implementation of a new model for nomination and certification that in turn 
produced significant increases in the numbers and proportions of diverse students referred and enrolled 
in gifted programs. The project has produced additional data on nontraditional measures that may 
hold promise for identification of giftedness across ethnic groups. 

Interviews provided insights regarding benefits not predicted by the proposal. As stated in the 
introduction, the senior GATE administrator attributes the retention of the district's GATE program to 
the project's development of a model that substantially increased access and provided strong theoretical 
and empirical justification. While not as dramatic, other benefits accrued that promise long-range 
impact. For example, project staff indicated that over thirty university shidents received direct experience 
in research related to improving the identification of gifted students from diverse ethnic and economic 
backgrounds. The project led to several graduate theses and dissertations that may stimulate further 
t asearch in this area. The project has amassed a data base that includes more African-American, Latino, 
and Indochinese students than included in the standardization sample of the WISC-R. A principal 
reported that compared to the Wechsler, after which students "felt frustrated," taking the Raven "boosted 
students' self-esteem" irrespective of their performance on the measure. Project staff observed that, 
after nontraditional students are certified as gifted by the Raven, teachers often begin noticing these 
students' positive behaviors. The project is both "chipping away" at stereotypes regarding ethnicity 
and intelligence and giving access to bright students who otherwise would remain undiscovered and 
lost in the system. 



Challenges 

As with any worthy but time-limited enterprise, the project has produced challenges for 
researchers and practioners alike. Project staff point to a need to continue the search for a selection 
model that provides equal access to gifted programs for shidents from all ethnic, linguistic, and economic 
groups. They suggest that dynamic assessment ultimately may provide one alternative. Additionally, 
it would be interesting to follow trie progress of nontraditional shidents identified for gifted programs 
to investigate long-term impact on these shidents. GATE staff perceive teacher and administrator 
inservice as a critical need to promote (a) understanding of the many manifestations of gifted potential 
and (b) strategies for accommodating gifted shidents who initially display heterogeneous achievement 
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levels As one principal cormnented, "We need to change attitudes as the population demographics 
have changed; teachers' styles of teaching need to change. In addition, conununity and parent awareness 
programs will support broader understanding of assessment issues and student needs. 



Conclusion 



In less than three years, the project "Identifying Underrepresented Disadvantaged Gifted and 
Talented ChUdren: AMultifaceted Approach" has directly affected the Uves of over one thousand newly 
identified ethnically and linguistically diverse gifted students who otherv^'ise would not have received 
special services. Moreover, the project's contribution to the insUtutionalization of a more equitable 
selection model in the nation's eighth largest school district will continue to impact subsequent cohorts 
of traditionally underrepresented gifted students as they enter the system. The results demonstrate the 
critical role played by federal funding of discretionary projects. 

The project's si.-v ess also attests to the importance of school-university coUaboration in the 
generation and implementation of new knowledge. Rapid change occurred through the interaction of 
highly competent researchers and far-sighted GATE leaders who recognized the need for research-based 
practice and were willing to take risks to improve access. Both project and GATE staff predict tliat the 
results are generalizable to other large districts serving similarly divorse populations. In hght of literature 
on multiple inteUigences (e.g.. Gardener, 1983) and the need to use multiple measures and avoid cutoff 
scores (e g Richert, 1991) especiaUy when assessing diverse students, writers in the field wiU question 
the use of a'single, standardized instrument for identification. Yet as the GATE administator suggested, 
"We welcome every urban school district to visit and review our program and would welcome an open 
debate of procedures used." 
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Abstract 



The efficacy of use of the Standard Raven Progressive Matrices 
Test (RPM) in the selection of gifted children from traditionally 
underrepresented groups was invesHgated in a large-scale study with a 
diverse population. A total of 16,985 Raven subjects included 22.7% Latinos, 
37% Whites, 14% African- American, 2.8% Asian, 8.4% Filipinos, and 5.6% 
Indochinese, each of whom had been identified potentially gifted based on 
a case shidy analysis by a school psychologist. The resultant sample of 
children certified gifted based on Raven performance was compared with 
a group certified gifted based on individual administrations of the Wechsler 
Intelligence Scale for Children - Revised (WISC-R). The Raven gifted sample 
was also compared to achial enrollment ratios for the school district to 
ascertain if equity in representation could be achieved for gifted programs 
using the Raven. Although chi square results indicated that use of the RPM 
as opposed to the WISC-R led to increased equity for all groups, full equity 
was not achieved. When compared to the district population, the Raven- 
selected group showed underrepresentation for Latinos and African- 
Americans and overselectiori for Whites, Asians, and FiUpinos. Chi square 
comparisons based on the expectation of equal gender representation 
revealed that the WISC-R disproportionately overselected boys, while the 
RPM showed no gender differences. The findings support the position 
that the RPM is a more equitable test than the WISC-R for ethnically diverse 
gifted populations and for girls. 
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Use of the Raven Progressive Matrices Test in an Ethnically Diverse Gifted Population 

Professionals in the field of Gifted Education continue to express considerable interest in 
developing alternatives to traditionally used standardized tests in identifying gifted students. Such 
alternatives are particularly needed for stiidents from ti'aditionaUy underrepresented groups such as 
African-Americans and Latino /Hispanics. One such alternative, repeatedly mentioned in the Uterature, 
is the Raven Progressive Matiices (RPM) Test (Raven, 1958, I960; Raven et al., 1986). A number of 
investigators have pointed to the RPM as a culturaUy reduced measure of cognitive ability wi\h 
considerable promise as a measure of giftedness in braditionaUy underrepresented and cultiarally diverse 
students (Baska, 1986; James, 1984; Powers & Barkan, 1986; Powers, Barkan, & Jones, 1986). 

The standard form of tfie RPM consists of 60 mabix problems, which are separated into five 
sets of 12 designs each. Within each set of 12, \he problems become increasingly difficult, and each of 
tiie five sets is progressively more difficvilt. Each individual design has a missing piece. The stiident's 
task is to select the correct piece to complete \he design from among six to eight alternatives. Correct 
responses are based on various organizing principles such as increasing size, reduced or increased 
complexity, and number of elements. 

Because RPM stimuH are visually presented, it is easy to mistake tixe test as one of visual 
perception or spatial reasoning. It is neither. As Cherkes-Julkowski, Stolzenberg, and Segal (1990) 
have noted "The Raven is as close to a stiidy of pure thinking processes in the absence of the influence 
of specific content acquisition as is available" (p.7). As Snow and Colleagues have shown "smg "dex 
and liierarchical models, the RPM is the best available measure of general mtelligence (Marshalek, 
Lohman, & Snow, 1983; Snow, KyUonen, & Marshalek, 1984). As a measure of general inteUigence the 
RPM correlates highly with verbal measures of ability, even though the stimuli themselves are completely 
nonverbal (Carpenter, Just, & Shell, 1990). In fact. Positron Emission Tomography (PET) Scans, which 
produce computer generated images of the brain, have shown that the entire brain is involved m solvmg 
RPM problems, with the three most used areas being the right cerebral hemisphere, the left temtporal 
lobes, and the left frontal lobes (Haier, Siegel, Nuechterlein, Hazlett, Wu, Paek, Browning, & Buchsbawn, 
1988). The left temporal lobe involvement is most likely due to tiie use of verbal codes m solvmg Kl M 
problems. 

Because its stimuU are nonverbal, \he RPM can be administered fairly in American schools to 
individuals who speak a language other than English. Because stimuU are visuaUy presented, rather 
than spoken, they are not transitory. Thus, the stimulus remains in front of the stiident, which reduces 
the role of memory and even attentional factors in performance (Cherkes-Julkowski et al., 1990). bolving 
RPM problems does not depend heavily, as do aU language based tests, on acquired knowledge, specific 
cultural experiences, or reading abiUty. As Carpenter et al. (1990) have noted, "The Raven measures 
the ability to reason and solve problems involving new information, without relying extensively on an 
expUcit base of declarative knowledge derived either from sdiooling or previous experience. In sum, 
the Raven measures general inteLUgence and correlates with measures of linguistic abiUty. It uses 
nonverbal stimuli and does not require a specific knowledge base. 

Previous investigations have found that tlie Raven has not only been effective in identifying 
traditionaUy underrepresented children for gifted programs, but also correlates with their success (Baska, 
1986). In one stiidy. Powers, Barkan, and Jones (1986) found no significant differences betw^n Hispanic 
and Anglo-American chQdren's mean scores, score variabiUty, and test rehabUity ^^^^f^^^-?^'' 
Sadies have supported the vaUdity of the RPM for Hispanic (Powers & Bark;an, 1986) and Navap 
stiidents (Sidles & MacAvoy, 1987). 

As Raven (1989) has noted, the Raven Progressive Mab-ices Tests have been used in over 1,600 
published psychological shidies (see Court, 1988; Court & Raven, 1977, 1982), making the RPM among 
Cmost researehed psychological tests. Until recently, however, the RPM has not been used widely in 
appUed clinical and educational settings. A major problem had been the lack of adequate U S. norms 
(Splan & Saccuzzo, 1989, 1993). An extensive and relatively current ° r^'^^;^^^^^ 
as weU as worldwide norms, is now available (Raven, Summers et al., 1986). More than 30,000 stiidents 
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aged 5 to 18 were chosen to be representative of school districts across the United States in approximately 
30 nonning shidies. Ethnicity and socioeconomic factors made independent contributions to the 
variance. Ethnic differences, which were attributed to differences in birth weight, infant mortality, and 
the incidence of serious childhood illness, showed a decline compared to earlier reports (Burciaga, 
1973; Hoffman, 1983; Jensen, 1980). There were, for example, no major Hispanic /White differences in 
the Ontario-Montclair School District of California. Moreover, the RPM was found to have equal 
predictive validity within each group (Hoffman 1983, 1986). 

It should be noted that while ethnic differences may be declining, there remain differences in 
the mean scores as a function of ethnicity as well as socioeconomic backgroimd. Nevertheless, the 
question remains as to whether the RPM can be used to achieve a more equitable selection of students 
than can be obtained with more widely used traditional tests, such as the Wechsler Intelligence Scales. 

In the present study we present the results of a Jacob Javits grant whose purpose was to evaluate 
the efficacy of the RPM in the selection of traditionally underrepresented groups for gifted programs. 
The data were collected over a three year period and include an archival data pool of over 5,000 
administrations of the Wechsler Intelligence Scale to children from 8 major ethnic backgrounds. 



Method 



Subjects: 

The subjects were 16,985 children who were referred and evaluated for the gifted program at 
San Diego City schools between the Fall of 1991 and the Spring of 1993. Students were classified into 
one of eight ethnic backgrounds based on self-report of the parent as follows: 3,864 Latino/Hispanic, 
6 286 White, 2^89 African- American, 483 Asian, 75 Native-American, 104 Pacific Islander, 1,419 Filipino, 
and 958 Indochinese. There were 1407 classified as "Other". Of the 16,985 subjects, 51.5% (8,740) were 
female, 48 5% (8,245) male. The distribution by grade level was as follows: 24 first-, 7,664 second-, 
1,467 third-, 819 fourth-, 3,737 fifth-, 748 sbcth-, 2,122 seventh-, 263 eighth-, and 90 ninth-graders. There 
were 51 cases where data on grade was missing. 

As a comparison sample, the files of all children evaluated for giftedness in the San Diego City 
School System between 1984 and 1990 were examined. During this time period, the district had used 
the WISC-R as the primary tool for identifying giftedness. A total of 9315 students had been given the 
WISC-R during this time period. 

A second point of comparison was based on actual enrollment figures by ethnic background 
during the course of the study The average enrollment ratio in the district as a whole by ethnic 
background between 1991 and 1993 was as follows: 27.2% Utino/Hispanic, 37.2% White, 16.2% African- 
/onerican, 2.3% Asian, 0,6% Native- American, 0.7% Pacific Islander, 8% Filipino, and 7.7% Indochinese. 

Procedure: , ^ j j / u 

Children were given either a WISC-R (entire 1984-1990 sample) or a Standard Raven (entire 
1990-1993 sample) by a district school psychologist. The WISC-Rs were individually administered; the 
Raven was group administered. As a part of the evaluation process the school psychologists conducted 
a case study analysis of each child to evaluate for the presence of risk factors and level of achievement 
(see Saccuzzo, Johnson, & RusseU, 1992). Five risk factors were considered. These were: cultural/ 
laneuaee, economic, emotional, environmental, and health. Achievement was evaluated m terms of 
standard scores on the California Test of Basic Skills (CTBS) or the Abbreviated Stanford Achievement 
Test (ASAT). 

ChUdren were certified as gifted if they obtained a Full Scale IQ of 130 on the WISC-R or an IQ 
equivalent of 130 on the Raven (i.e., achieved a score in at least the 98th percentile) based on \he Smootiied 
U S Norms reported by Raven et al. (1986). In addition, children who had two or more of the nsk 
factors were certified as gifted if they had a Full Scale WISC-R IQ of 120 or Raven IQ equivalent of 120. 
Finally, children who obtained a WTSC-R or Raven score of 3 standard deviations above tiie mean were 
placed in a special ''Seminar" program. 

3f; 



Results 



Figure 1 illustrates the ethnic composition of the district for all eight ethnic backgrounds 
compared to all children who were certified as gifted with the WISC-R (entire 1984-1990 sample) and to 
all children who were certified as gifted with the Raven (entire 1990-1993 sample). Inspection of Figure 
1 reveals that the RPM led to increased equity for all ethnic groups. For example, only about 20% of the 
expected number of Utino/Hispanic were selected with the WISC-R, while about 80% of the expected 
were selected using the RPM, For the Whites, about 200% of the expected were selected with the WISC- 
R, while only about 120% of the expected were selected with the RPM, Pacific Islanders, Native- 
Ainericans, and Indochinese were all greatly underselected by the WISC-R, but selected according to 
the expectation based on their numbers with the Raven. Filipinos went from about 60% of expectation 
with the WISC-R to about 120% with the Flaven, African-Americans went from about 40% of the 
expectation to over 60%, Thus, the RPM provided a more equitable distribution for all ethnic 
backgrounds. 



Figure 1 , Ethnic composition of those certified gifted on the basis of WISC-R scores (1984-1990) 
versus Raven's Progressive Matrices scores (1990-1993), 
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Ethnicity 



To demonstrate statisticaUy the superiority of the RPM over the WISC-R in tenns of equity, we 
used the foUowing procedure. Chi Squares were computed for each ethnic group using the number of 
children certified by use of the WISC-R in each ethnic background as the expected and the number of 
children in that ethnic background selected by use of the Raven as tl^e observed. Table 1 provides a 
summary of the findings. As Table 1 shows, there were significant differences in the dire ction of increased 
equity for all groups but Native-Americans (where the n was smaU) and Asians. 



Table 1. Chi Square results using children certified with the WISC-R as the expected 
and children certified with tlie Raven as the observed 



Ethnicity 




n (Expected) 


n (Observed) 


Chi Square 


Latino/Hispanic 




413 


1304 




White 




5535 


3441 


792.20*** 


African- American 




454 


714 


148.90*** 


Asian 




260 


306 


8.14 


Native American 




20 


36 


12.80 


Pacific Islander 




10 


50 


160.00*** 


Filipino 




321 


713 


478.70*** 


Indochinese 




112 


513 


1435.72*** 



p<.05 
p<.01 
p<.001 



Despite the substantial improvement in equity when the RPM was used, inspection of Figure 1 
indicates that full equity was not achieved. To verify this observation statisticaUy Chi Squares were 
again computed using the proportionate number of children in each ethnic background as the expected 
compared to the number actually certified with the Raven as the observed. 

Table 2 presents a summary of these findings and reveals significant discrepancies for all groups 
except Native-Americans, Pacific Islanders, and Indodiinese. Latino/Hispanics and African-Americans 
were significantly underselected compared to the expectation while Whites, Asians, and Filipinos were 
significantly overselected. 



Table 2. Chi Square results comparing the actual proportion of children in each ethnic 
background with the number actually certified with the Raven 



Ethnicity 

Latino /Hispanic 
White 

African-American 
Asian 

Native American 
Pacific Islander 
Filipino 
Indochinese 



df 



p<.05 
p<.01 
p<.001 



n (Expected) 


n (Observed) 


Chi Square 


2009 


1351 


215.51*** 


2743 


3568 


244.69*** 


1200 


766 


156.96*** 


170 


319 


130.59*** 


41 


39 


0.10 


55 


48 


0.89 


595 


739 


34.85*** 


569 


532 


2.41 
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To examine possible ethnic differences on the Raven independently of risk factors, we used the 
following system. First, we considered only those children who scored in the 98H\ percentile or better 
on the Raven. Next, we determined the number of children in the school district in each ethnic background 
that would represent 2 percent of that group. For example, given the total number of Latino /Hispanics 
in the district, 905 would be in the top two percent. The actual nimiber of children in the tcp two 
percent for each ethnic background was used as the expected, while the number of children in eachi 
backgromid who actuaUy scored in the upper 2% (i.e., 98th percentile or better) was used as the observed 
in a s^?ries of Chi Square analyses. Table 3 summarizes these analyses and shows significant discrepancies 
for four groups: Latino /Hispanics and African- Americans, who were underselected; and Whites and 
Asians who were overselccted compared to the expectation. 



Table 3. Chi Square results comparing the actual number of children in the top 2% of each ethnic 
background with the number who actually scored in the upper 2% 



Etlinicitv 



Latino /Hispanics 
Whites 

African-Americans 
Asians 

Native Americans 
Pacific Islanders 
Filipinos 
Indochinese 



n (Expected) 


n (Observed) 


Chi Square 


905 


379 


305.72*** 


1238 


2009 


480.16*** 


541 


209 


203.74*** 


77 


173 


119.69*** 


18 


21 


0.50 


25 


12 


6.76 


268 


316 


8.59 


256 


199 


12.69 



♦ p < .05 
** p< .01 
*** p < .001 



To investigate the possibiUty of gender differences in WISC-R and Raven performance, the 
number of boys and girls who actuaUy scored in the 98th percentUe and above on each test in each 
ethnic group was compared to the expectation that there would be equal numbers of boys and girls. As 
can be seen in Table 4, there were no significant differences in performance for boys and girls of any 
ethnic background on the Raven. 



Table 4. Chi Square results comparing the number of boys and girls of each ethnic 
background who actually scored in the top two percent on the Raven 



Ethrdcity 

Latino /Hispanics 
Whites 

African-Americar^ 
Asians 

Native Americans 
Pacific Islanders 
Filipinos 
Indochinese 



Total n 


Boys (Observed) 


Chi Square 


374 


181 


0.19 


2002 


1057 


3.13 


206 


116 


1.64 


172 


79 


0.57 


21 


12 


0.21 


12 


6 


0 


315 


170 


0.99 


198 


100 


0.01 



* p < .05 
p < .01 
*** p < .001 
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For the WISC-R, by contrast, significantly more White boys (2632) than girls (2272) achieved 
an IQ score in the top two percent (Table 5). Moreover, the trend was in favor of boys on the WISC-R 
for every ethnic group except Pacific Islanders. 



Table 5. Chi Square results comparing the number of boys and girls of each ethnic 
background who actually scored in the top two percent on the WISC-R 



Ethnicity 


df 


n 


Boys (Observed) 


Chi Square 


La tino / Hispaiuc 


1 


265 


152 


2.87 


White 


1 


4904 


2630 


13.21*** 


African-American 


1 


252 


129 


0.07 


Asian 


1 


202 


109 


0.63 


Native American 


1 


13 


9 


0.96 


Pacific Islander 


1 


9 


3 


0.50 


Filipino 


1 


182 


107 


2.81 


Indochinese 


1 


61 


34 


0.40 



* p < .05 
** p<.01 
*** p < .001 



In order to examine performance of individuals at the highest level of abiUty (i.e., three standard 
deviations above the mean), it was necessary to generate local norms, since the norms provided in the 
manual only go up to the 99th percentile. In constructing local norms for the 99.1 through 99.9 percentile, 
we used the following procedure. Based on the table of smoothed North American norms provided by 
Raven, et al. (1986), we selected all tliose children in our shidy who had obtained a raw score in the 99th 
perceritUe for each age range in the table. We then examined the frequency distribution of raw scores 
for each age range and attempted to break the scores down into ten groups occurring with equal 
frequency, representing the 99.0 through 99.9 percentile. 

In the analyses that followed, the local norms (Guertin, Johnson, Saccuzzo, & Lopez, 1992) 
were used to identify shidents who scored three standard deviations above the mean (i.e., above the 
99 87 percentUe). Figure 2 Ulustrates the proportion of shidents from the entire 1990-1993 sample, by 
etimic background, who obtained scores at tiie 99.9 percentile on tiie local Raven norms versus the 
proportion, by ethnic background, who obtained WISC-R Full Scale IQ's of 145 or above. As mspechon 
of Figure 2 shows, there was an increase in the representation of children selected for the schools very 
sifted "Seminar" program for aU ethnic backgrounds except for the Whites, who represent about 35 
percent of the distiict. With the WISC-R, Whites represented more tiian 80% of tiie children selected for 
the "Seminar" program. Witii the Raven, Whites represented less tiian 60%. 
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Figure 2. Ethnic composition of the 99.9th percentile: WISC-R versus 
Raven's Standard Progressive Matrices 




To demonstrate statistically the superiority of the RPM over the WISC-R for selection at the 
highest level of ability, Chi Square analyses were computed for each ethruc background using the 
proportionate number of children selected by the WISC-R as the expected and the number selected by 
the Raven as the observed, as was done in Table 1. Results were sinular to those found for the children 
who scored above 130. In fact, there were significant increases in the direction of increased equity for all 
ethnic backgrounds except Asians and Native Americans (See Table 6). 



Table 6. Chi Square results using children certified "Seminar" with the WISC-R as the 
expected and children certified "Seminar" xoith the Raven as the observed 



Etlinicity 




n (Expected) 


ti (observed) 


Chi Square 


Latino/Hispanics 
Wliites 




24 
430 


69 
300 


84.38*** 
38.79*** 


African- Americans 




22 


33 


5.50* 


Asicms 




26 


33 


1.88 


Native Americans 




1.4 


3 


1.83 


Pacific Islanders 




1 


4 


9.00** 


Filipinos 
Indochinese 




13 
4 


49 
29 


99.69*** 
156.25*** 



p<.05 
p<.Ol 
p<.001 
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As with the children who scored two standard deviations above the mean, there were inequities 
for children who scored three standard deviations above the mean. A total of 520 children in our 
sample scored in the 99.9th percentUe on the Raven, as determined by local norms. We compared the 
number who actuaUy scored at that level with the number expected based on district proportion for 
each ethnic group. Results, summarized in Table 7, indicate that Latino/ Hispanics and African- 
Americans were underrepresented while Whites, Asians, and Indochinese were overrepresented. 



Table 7. Chi Square results comparing children certified "Seminar" using the Raven 
to their proportionate numbers in the district 



Ethnicity 

Latino / Hispanics 
Whites 

African-Americans 
Asians 

Native Americans 
Pacific Islanders 
Filipinos 
Indochinese 



n (Expected) 


n (Observed) 


Chi Square 


141 


69 


36.77*** I 


193 


300 


59.32*** 


84 


33 


30.96*** ■ 


12 


33 


36.75*** 1 


3 


3 


0.00 


4 


4 


0.00 _ 


42 


49 


1.17 1 


41 


29 


3.51 ■ 



♦ p < .05 
p < .01 
p < .001 

Figure 3 illustrates the gender composition for children who scored either at or above the 
99.9 percentile on the Raven or had a Full Scale WISC-R IQ of 145 or greater. 

Figure 3. Gender composition of the 99.9th percentUe: WISC-R versus Raven's Standard 
Progressive Matrices 



100 r 



80 



o 



to 40 



|g Male 
□ Female 




WISC-R R«ven 
Means of Certification 



(50%) 



42 



As Figure 3 shows, the substantial gender imbalance in favor of males with the WISC-R was 
essentially eliminated with the RPM. Chi square analyses statisticaUy demonstrated the reduction in 
gender bias at the highest levels of abUity. Results Site summarized in Tables 8 and 9, where actual 
nvimbers of boys and girls who scored three standard deviations above the mean on the WISC-R and 
the Raven, respectively, were compan^d with the expectation of equal numbers of boys and girls. 



Table 8, Chi Square results comparing boys and girls certified "Seminar" using the 
WISC-R with the expectation of equal gender numbers 



Etlinicity df Total n 

Latino/Hispanic 1 52 

White 1 925 

African-American 1 47 

Asian 1 56 

Native American 1 3 

Pacific Islander 1 2 

Filipino 1 29 

Indochinese 1 8 



Boys (Observed) 

29 

570 

26 

30 

2 

2 

21 

5 



* p < .05 
p < .01 
p < .001 



Chi Square 

0.35 

22.47*** 

0.27 

0.14 

0.17 

1.00 

2.91 

0.25 



Table 9. Chi Square results comparing boys and girls certified "Seminar" using the 
Raven with the expectation of equal gender numbers 



Ethnicity 

Latino/Hispanic 
White 

African-American 
Asian 

Native American 
Pacific slander 
FUipino 
Indochinese 



1 
1 
1 
1 
1 
1 
1 
1 



n (total) 

69 

300 

33 

33 

3 

4 

32 

29 



Boys (Observed) 



34 

156 

18 

15 

1 

3 

32 
15 



Chi Square 

0.007 

0.24 

0.14 

0.14 

1.16 

0.5 

2.30 

0.02 



p<.05 
p<.01 
p<.001 
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The efficacy of the Raven was compared to that of the WISC-R for predicting achievement in a 
series of correlations between intelligence tests and achievement tests used by the school district. Each 
child received a test of intelligence and an achievement test during the same school year. Table 10 
demonstrates correlations between performance on the Raven, as expressed in z-scores, and on the 
subtests of the Comprehensive Test of Basic Skills (CTBS) for 1707 children, as well as correlations 
between CTBS and WISC-R for another 1925. As can be seen, the Raven correlated more highly with 
CTBS Total Language subscores than did the WISC-R for African-American and White children, but 
not for Latinos. In fact, for African-Americans no significant correlations were found between WISC- 
R scores and CTbS Total Language. Raven performance correlated more highly with CTBS Total Math 
scores than did WISC-R only for White children. For all three ethnic backgrounds, WISC-R VIQ scores 
were more highly correlated with CTBS Total Reading scores than were Raven scores. 



Table 10. Correlation(r) of Raven or WISC-A performance with achievement test 
(CTBS) performance for children of three ethnic backgrounds 



Whites 

RAVEN (n = 901) 
VIQ (n = 1566) 
PIQ (n = 1566) 
FSIQ {n = 1566) 

African- Americans 
RAVEN (n = 276) 
VIQ (n = 221) 
PIQ (n = 221) 
FSIQ (n = 221) 

Latinos/Hispanics 
RAVEN (n = 530) 
VIQ (n = 138) 
PIQ (n = 138) 
FSIQ (n = 138) 





Subtest 




CTBSTL 


CTBSTR 


CTBSTM 


.1907** 


.1727** 


.2187* 


.1638** 


.2479** 


.1836** 


.1081** 


.1274** 


.1648** 


.1623** 


.2292** 


.2069** 


.2372** 


.1916** 


.1257* 


.0691 


.2740** 


.1440 


.0184 


.1126 


.1771** 


.0541 


.2406** 


.2002** 


.1563** 


.0797* 


.1327** 


.1899* 


.1365 


.3150** 


-.0476 


- .0196 


.1708* 


.0761 


.0725 


.2718** 



*p < .05 
** p < .01 
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Correlations between the Raven and the Abbreviated Stanford Achievement Test (ASAT) for 
chUdren of six ethnic backgrounds are summarized in Table 11. Correlation coefficients ranged from 
.2235 to .3109, with a median r = .25. Clearly, the Raven correlates more highly with ibe ASAT than with 
the CTBS in all three domains measured (language, reading, and math). In addition. Raven— ASAT 
correlations were higher than WISC-R— CTBS correlations. 

Table 11. Correlation (r) of Raven performance with achievement as measured by the 

Abbreviated Stanford Achievement Test for children of six ethnic backgrounds 
(Raven performance was expressed in Z scores.) 



Ethnicity 

African- American 

Asian 

White 

Filipino 

Indochinese 

Latino 



n 

1581 

305 

4020 

1002 

587 

2528 



Total 

Language 

.2478** 
.2386** 
.2245** 
.2235** 
.2582** 
.2468** 



Total 
Reading 

.2639** 
.2654** 
.2252** 
.2438** 
.2484** 
.2981** 



Total 
Math 

.2695** 
.2634** 
.2494** 
.2429** 
.2584** 
.3109** 



'p< .01 



Discussion 

Our results lead to two clear conclusions. First, considering only children referred for gif tedness 
testing the RPM produces far better equity for all ethnic backgrounds when compared to the WISC-R. 
While the RPM overselected Whites, it did so at a substantially reduced rate (i.e., 120 percent over 
expectation with the Raven vs. 200 percent over expectation with the WISC-R). Moreover, while it did 
not produce complete equity, even when considering such risk factors as low socioeconomic or cultural 
differences the RPM led to substantially increased selection ratios for tiraditionally underrepresented 
groups such as Latino/Hispanics and African-Americans. Moreover, the RPM did lead to an equitable 
selection for Native-Americans, Pacific Islanders, and Indochinese, all of whom had been 
underrepresented with the WISC-R. 

The success of the RPM with Indochinese is of interest in terms of evaluating non-English 
speakiiie children. In the past the district had difficulties evaluating giftedness in Indochinese children 
who spoke Uttle or no English. Since use of the RPM enabled evaluation of abiUty i^dependentiy of 
language it was possible to assess these children with the least bias heretofore achieved. With the RPM 
Indochinese were selected almost exactly in proportion to their numbers in the disti-ict as a whole. 

The advantage of the RPM was not only evident in terms of producing a more equitable 
disti-ibution, but it was more effective than the WISC-R in predicting language achievement for Afncan- 
American and White children, as well as in predicting math achievement for Whites. Although our 
data cannot be used to directly compare the WISC-R with tlie Raven for predicting scores on ti^e ASAT, 
RPM scores were more highly correlated with ASAT scores than with CTBS scores for aU groups. 
However the WISC-R correlated more highly than the RPM with CTBS Total Readmg for all groups 
and with Total Math for African-Americans and Latinos. It would be insfaructive to directly compare 
WISC-R — ASAT and Raven — ASAT correlatiol^s in the same sample to more completely compare 
efficacy for predicting achievement. 

The Raven's abiUty to predict achievement, even language achievement, is due to its high 
correlation with Spearman's (1904, 1927a, 1927b) g factor (see Carpenter et al 1990; Marshalek et al 
1983- Snow et al., 1984). Since tests of language achievement are highly correlated with g, g-sahirated 
tests such as the RPM share common variance with them. Thus, the RPM can have clear advantages for 
measuring abiUties for individuals who speak a language other than English or are from a different 
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culture (Cotirt, 1991). Moreover, since it does not depend on an expUcit knowledge base, as does the 
WISC-R and other verbally weighted standardized tests, the RPM is better suited to traditionally 
underrepresented children. It must be emphasized, however, that the RPM is not simply for 
underrepresented children, as it led to a more equitable distribution across ethnicity, even for Whites. 

Previous studies that have compared the WISC-R and the Raven in the selection of gifted children 
(e.g., James, 1984; Pearce, 1983; Tulkin & Newbiough, 1968; Meeker, 1973; Kier, 1949) have been primarily 
concerned with the correlation between the two measures. Such studies have reported correlations m 
the .70's and have been generally supportive of the Raven as an alternative. Our findings suggest that 
where equity is a concern, the Raven is a far better measure. 

It should be noted, however, that the population of gifted children selected by the Raven is not 
identical to that selected by the WISC-R. The differences are important. First, there is the group of 
children who would be selected by either test. These children tend to be verbaUy advanced, highly 
intelligent, and high in achievement. Second, there is the group of children who would be qualified by 
a WISC-R, but not by a Raven. In our experience, such children tend to be verbally advanced and 
highly motivated. Teachers of academic gifted programs readily accept these children as gifted, especiaUy 
in the early years. Due to the unreUability of IQ scores at the upper IQ levels, especiaUy for the younger 
age levels, the IQ's of many of these verbaUy advanced children show regression to the mean as they 
mature. 

A third group of children are those who would not qualify with the WISC-R, but would with 
the Raven. Such children are of extremely high potential, but may be only average (or even below 
average) in achievement. Others may be of very high ability, but poorly motivated. Our experience has 
revealed that this type of shident is not always accepted by teachers as gifted. Yet, it is tliis very type of 
chUd — the one with raw, undeveloped potential — for whom the present investigation was aimed. For 
programs using the Raven on a widescale basis, teacher training is often needed to help integrate these 
students into the gifted classroom. 

One of the reasons tests like the WISC-R continue vo be used, in spite of their obvious selection 
bias, is that they are reUable and objective. To date, no other approach matches standardized tests m 
terms of reUability, predictive vaUdity, and objectivity. Yet, given the huge selection bias inherent in the 
WISC-R, it is difficult to imagine how it, and others Uke it, can continue to be used in today's Utigious 
environment and heightened awareness of civil rights. A simple examination of Figure 1 would find 
the WISC-R completely unacceptable as a tool to select for giftedness. 

The Raven clearly fared better in terms of equity without sacrificing objectivity, reUability, and 
predictive validity. Nevertheless, it is also clear that the Raven, even when used in conjunction with a 
consideration of risk factors, stUl falls short of producing equity across aU ethnic groups. 

Based on our present findings, and on previous reviews of psychometric tests (Kaplan & 
Saccuzzo, 1993), we are convinced that as traditional tests are presently used, there exists not a smgle 
one that would produce an equitable selection for gifted programs. If the goal is to select chUdren for an 
academic program using an objective, reUable measure with high predictive vaUdity, their traditional 
tests as they are presently used, faU short in terms of equity. The search for multiple inteUigences, as 
suggested by Gardner (1983), is, of course, one viable option. The use of portfoUos and other subjective 
approaches, however, while promising on a smaU scale, faces numerous obstacles in terms of objectivity, 
reUabUity, and predictive vaUdity. . ,.u 

The question remains, are traditional tests beyond redemption? We beUeve that there reniam 
potentiaUy promising options. One such option, suggested by Raven (1989), is the me of loca ethmc 
norms for selection purposes in gifted programs. A second, suggested by Carlson (1989) and CoUeagues 
(Carlson & DiUon, 1978; Carlson & Wiedl, 1978), is to use traditional tests in creative ways. For example, 
using a dynamic testing approach with the Raven, Carlson and Wiedl (1979) were able to eUminate 
Hispanic/ White and Black/ White differences in IQ. Before we abandon what remams the most objective, 
reUable and vaUd approach to selection, it behooves us to determine whether innovative uses can rescue 
traditional assessment procedures, or if they should be abandoned in favor of less psychometncaUy 
sound, but perhaps more equitable, approaches. 
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ABSTRACT 



The Wechsler Intelligence Scale for Children-Revised (WISC-R) is the 
most widely used individual instrument for inclusion or exclusion of children 
into programs for the gifted in the United States. The present study investigated 
the psychometric adequacy of this use of the WISC-R in a population of 8396 
potentially gifted elemental y grade children of diverse ethnic and cultural 
backgrounds as well as diverse emotional and social environments. Shady I 
included analyses of VIQ-PIQ base rates in 5796 children who achieved Full 
Scale IQ (FSIQ) scores of 130 or above, plus comparisons of similarities and 
differences in subsamples divided on ethnic background, on level of risk 
identified in the child's home environment, and on the extremes of achievement 
(measured by a standardized achievement test). In contrast to findings from the 
WISC-R standardization sample, children in this study differed strikingly, across 
ethnic groups and across levels of risk, in shape of the VIQ-PIQ difference 
distribution but not in absolute size of the VIQ-PIQ difference. The frequency 
distributions for African-Americans and Caucasiar\s were skewed in favor of 
VIQ; for Filipinos, in favor of PIQ. Those of Asians and Hispanics closely 
resembled normal distributions. Skewness for children with identified risk was 
in favor of PIQ relative to those without risk. The importance of clinical versus 
statistical significance in decision-making was discussed, with particular 
attention to what constitutes a 'rare' VIQ-PIQ difference in gifted children. Study 
n attempted, through multivariate modeling, to identify either a single model 
or individual models, using subtests of the WISC-R, that would select equally 
accurately from five ethnic groups (African-American, Asian, Caucasian, Filipino, 
and Hispanic). No single model or combination of individual models was found 
to select equally from each of the ethnic backgrounds in a proportionately 
balanced random subsample of 1438. ImpUcations for this use of the WISC-R in 
diverse gifted populations whose characteristics differ from those of the 
standardization sample were discussed, in Ught of the professional ethics of 
responsible test use. 
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I. Introduction 



Currently, identification of giftediiess in school age children is undertaken nationwide with the 
aim of providing special educational services for those with special gifts and talents. HistoricaUy, the 
use of tests to identify individuals with special talent has been recorded as early as 2200 BC in China 
(IDuBois, 1970). In 1869 Galton first addressed the concept of genius in the psychological literature. In 
1925 Teman began the first major study in which giftedness was operationally defined in terms of 
performance on standardized IQ tests. Since these landmark contiibutions, confUct and conbroversy 
have abounded in the educational and psychological Uterature on giftedness. Disagreements continue 
over the definition of giftedness per se, its measurement by the use of IQ and achievement tests, and its 
nurtiirance by special instiiictional programs. This work wiU focus on one aspect of giftedness: ethical 
use of tests in the selection of chUdren from diverse backgrounds for early inclusion in special programs 
for the gifted and talented. 

Identification and inclusion of gifted children from varied culhiral and linguistic backgrounds 
into gifted and talented programs at an early age is vital. As Horowitz and O'Brien (1986) note, "there 
is no way to measure the loss when individuals capable of functioning considerably above the normal 
level do not conti-ibute as much to society as their capabilities will aUow" (p. 1147). The summary of 
findings in an evaluation of the Gifted and Talented Education (GATE) program in San Diego m tiie 
academic year 1989-1990 (MiUett, 1990) included the information that "GATE shidents outperformed 
gifted shidents who are not participating in the program at every grade level" (p. 9). Given the 
demonsti-ated benefits of programs for gifted children, educators face the challenge of early identification 
of children with tiie highest potential for inclusion in enrichment programs. This problem becomes 
more critical in Ught of mandates that educational programs strive to guarantee equal access and yet 
operate within a framework of increasingly restirictive educational budgets. 

Problems in the Definition of Giftedness 

Currently in this country most efforts to identify giftedness in chUdren utili7,e a definition based 
on intelligence, measured by some form of standardized group or individual IQ test. The ti-end began 
to be formalized in 1971, when the first definition of gifted and talented children was proposed on a 
national level (Pub. L. 91-230, § 806): 

Gifted and talented chUdren are those identified by professionaUy qualified persons who 
by virtiie of outstanding abiUties, are capable of high performance. These are children who 
require differentiated educational programs and/or services beyond those normaUy provided 
by the regular school programs in order to realize their contiribution to self and society. 

ChUdren capable of high performance include those with demonsh'ated achievement and/ 
or potential ability in any of the f oUowing areas, singly or in combinations: (1) general inteUectual 
abUity, (2) specific academic aptitiide, (3) creative or productive thinking, (4) leadership ability, 
(5) visual and performing arts, (6) psychomotor ability. 

Seven years later, 42 states had either enacted laws or formulated guidelines for the definition of 
giftedness. In aU 42 states, including California and New York, general inteUectiial abUity was specified 
(Fox, 1981). 

The issue of the definition of the natiire of "intelligence," thought to underlie inteUechial abQitj' 
and academic performance, has probably been debated as much as any other in the history of the 
psychological Uterahire. Binet (in Terman, 1916) defined intelligence as "the capacity to make adaptations 
L the piSpose of attaining a desired end" (p. 45). Spearman (1923) wrote that mteUect mvolves educmg 
either relations or correlates" (p. 300), and proposed a two-factor theory; g was defmed as '^derlym^ 
general mental energy, whereas s represented one or more specific factors. Wechsler (1958) espoused 
ttie definition: "the aggregate or global capacity of the individual to act purposefu ly, to thmk rationally 
Tnd to'eal effectivf with his environment" (p. 7). However, Thomdike (1927 theorized *at 
intelUgence involves interconnected but distiUCl abiUties and so advocated a mullifactor approach. 



Guilford (1%7) developed a multif actor theory of intelligence based on three dimensions— the operations 
involved in information processing, the contents, and the products. In contrast, Vernon's (1950) was a 
hierarchical theory of intelligence based on the hypothesis that g is at the highest level of the hierarchy 
and represents the broadest aspect. 

More recently, Sternberg (1986) developed a theory that divides intelligence into three 
dimensions. Gardner (1983), on the other hand, suggested that there are several distinct and relatively 
separate competencies, which he described as multiple inteUigences. The debate continues, with some 
theorists espousing models based on an underlying basic mental capacity and others favormg a set of 
distinct and relatively discrete mental abilities. 

Issues in the Assessment of Intellectual Giftedness 

In acknowledging that there are many definitions of what constitutes intelligence, we must 
also acknowledge that there are many tests that purport to measure it. At the present time, however, 
the single instrument most frequently used for identification of giftedness in children m the Umted 
States is the Wechsler Intelligence Scale for Children - Revised (WISC-R) (Klausmeier, Mishra, & Maker, 
1987). The WISC-R has been widely acknowledged to have excellent reliability and concurrent validity 
(Sattler 1988) The current study examined the characteristics, efficacy, and fairness of this particular 
use of the WISC-R in one large school district (San Diego City Schools) over a seven-year span of time. 

The San Diego City School District is among thie most culturally diverse in the nation. The 
1991-92 student population of 123,503 included 35.4% Caucasian, 28.8% Hispanic, 16.3% African 
American, and 8.1% Filipino children. The remaining 11.4% consisted of Indochinese, Asian, Pacific 
Islander, ^d Native American students. Programs for gifted and talented students, begun in ttie 
district in the 1940's, have demonstrated an on-going commitment to achieving equal access for 
individuals of all ethnic backgrounds through the use of selection instruments more likely to identify 
giftedness in culturally and linguistically different students. Despite these attempts, the non-Caucasian 
student population in gifted programs was 36.3% in 1989-90, as opposed to 61% in the school district as 
a whole Underrepresented groups included Hispanics and African Americans; overrepresented were 
Asian, Filipino, ar.d non-Hispanic white students (Millett, 1990). Richert (1987) cited figures published 
by the U.S. Department of Education's Office of CivU Rights revealing that groups such as Hispamcs 
and African-Americans are underrepresented by as much as 70% in gifted programs throughout this 
nation Thus the underselection of these two groups in San Diego reflects a nationwide problem. The 
National Report on Identification for Gifted and Talented Youth (Richert, Alvino, & McDonnel, 1982) 
noted problems with braditional selection procedures. Indeed, today most authorities beUeve that 
especiaUy for disadvantaged groups braditional standardized tests should not be tiie sole or even tiie 
primary measure of giftedness (Fox, 1981; Garcia, 1981; Horowitz & O'Brien, 1986; McKenzie, 1986; 
Meeker & Meeker, 1973; RenzuUi, 1978; Sternberg, 1981). 

The American Educational Research Association, The American Psychological Association, 
and the National CouncU on Measurement in Education take the position tiiat "In elementary or 
secondary education, a decision or characterization that will have a major impact on a test taker should 
not automatically be made on the basis of a single test score." (Standard 8.2, Standards for Educational 
and Psychological Testing, 1985). Although many authorities do recommend the use of multiple 
identification procedures such as IQ, achievement, and behavioral data in the identification of giftedness 
in practice much emphasis is comn^only placed on a single measure of achievement or of overaU 
intelligence (Alvino, McDonnel, & Richert, 1981). Zigler and Farber (1985) stated that a specif ic defined 
level of IQ (such as a score two standard deviations above the mean) is the most adequate mdex of 
giftedness. Pegnato and Birch (1959), Clark (1979), and Hagen (1980) recommended use of an 
k^dividually administered IQ test as the best and the quickest way to find mosi gifted children. Sattler 
(1988) concluded that "the single best method avaUable for the identification of children with supenor 
cognitive abilities is a standardized, individually administered test of intelligence..." (p. 671), but went 
on to note that among those who are difficult to identify as gifted are childreri who are culUirally 
different especially since they may not show superior oral language skills. Indeed, as was so weU 
expressed by the Standards for Educational and Psychological Testing (1985), "Achild from one cultiire 
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who is evaluated with mores appropriate to another culture may be considered taciturn, withdrawn, or 
of low mental ability." 

Methodological Issues in WISC-R Testing of Ethnic Groups 

Use of standardized intelligence test summary scores without ethnic, cultural, gender, economic, 
and other considerations is based on a uniformity assumption: that aU students, aU testers, and aU 
situations are homogeneous. The fallacies inherent in this assumption in the use of standardized tests 
have been repeatedly noted (Guertin, Ladd, Frank, Rabin, & Hiesler, 1971; Lewandowski & Saccuzzo, 
1976) Unforhmately, most standardized tests have only a single set of norms that have not been corrected 
for the demographic characteristics of the individual. The WISC-R, for example, yields scores corrected 
only for chronologic age. It has long been recognized that the influence of demographic vanables m 
tests of brain function is apparent for individuals (Finlayson, Johnson, & Reitan, 1977; Reitan, 1955). For 
example recent cross-sectional studies of the Wechsler tests for adults indicate that a smgle set of norms 
cannot be used for individuals at different age and education levels (Heaton, Grant, & Matthews, 1986). 

Further the use of a single summary score may mask differences in the pattern of strengths 
across ethnic backgrounds and gender in gifted chUdien. Lesser, Fifer, and Clark (1965) reported results 
of a comparison of mental abiUties in seven and eight year old first grade chUdren from four ethnic 
groups and two socioeconomic levels in New York. Individuals of African-American, Chmese, Jewish 
and Puerto Rican background were compared on four basic dimensions of mental abiUty usmg a modified 
version of the Hunter CoUege Aptitude Scales for gifted children. The chUdren were found to differ in 
pattern of mental abiUties across ethnic background but not across socioeconomic status. Lesser et al. 
proposed that identification of the pattern of relative strengths and weaknesses of children from varied 
cultural backgrounds was a vital prerequisite to decisions about education in general and cumculum m 
particular. 

Methodological Issues in Quasi-Experimental Assessment Studies 

In reviewing the Uterature on the use of tests with different ethnic groups, several methodoUgical 
issues become apparent. Some are inherent in the nature of quasi-experimental and archival design 
(e g the impossibility of random assignment to groups on key factors such as ethnic background or 
socioeconoinic status), and limit the generalizabUity and appUcabiUty of the studies. Others resultfrom 
a faUure to control for moderator variables such as socioeconomic status and acculhiration, or from a 
failure to use multiple methods within the same study Several of these points wiU be iUustrated m the 
following examination of studies subsequent to Lesser, et al. 

Since pubUcation of Lesser et al.'s findings, numerous investigators have made attempts to 
confirm differences in pattern of mental abiUties across ethnic groups (e.g., Flaugher & Rock, 1972; 
Hennessv & Merrifield, 1976; Sitkei & Meyers, 1%9). None of the subsequent studies have used ttie 
same teste, the same ethnic groups, or even children of the same ages. Most did not control for level of 
abiUtv and no single study looked at aU of these confounds systematically. Despite these flaws there 
has been a tendency among reviewers (e. g., Sattler; 1988) to characterize these attempts as faUure to 
repUcate" the findings of Lesser et al. Indeed, Sattler cited one study of 4 year olds (Sitkei & Meyers 
1969) oneof junior Wghschoolstudente(Raugher&Rock,1972),and one ofWghsch(X)lseniors accepted 
for admission to a major university (Hennessy & Merrifield, 1976) as evidence of failure to rephcate. 
•niese studies were factor analytic in nature. Whereas the original work by I.^sser et al. was based on an 
analysis of covariance method of comparing mean scores on teste across groups, the shidies cited by 
Sattler, as well as elsewhere in the Uterature, compared factor structure of a given test across groups, 
and some included a comparison of faciOT means across the groups. 

Several issues in quasi-experimental design and methodology become apparent when such 
studies are compared as "repUcati.ons": 

1 It is not vaUd to compare resulte of studies with populations of preschool children, elementary 
school children, junior high school shidents, and high school seniors. The increased exposure to 
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environments outside the cultural environment of the home as the child progresses through school is, 
for example, an enormous confound and provides a valid alternative hypothesis for the different findings. 

2. Different assessment batteries can produce different results. Sitkei and Meyers (1969) used 
an extensive battery that included the Peabody Picture Vocabulary Test, which, as the authors 
acknowledged, is much less highly verbally loaded than the measures used by Lesser et al.. In fact, 
Sitkei and Meyers offered this lower verbal demand as one possible alternative hypothesis for the 
difference in their findings from those of Lesser et al. 

3. Results at one level of intelligence do not necessarily generalize to others. Lesser et al. 
studied children matched on the basis of social class, gender, and ethnic membership; each of those 
matching variables has been correlated with differences in performance on tests of mental ability. 
Hennessy and Merrifield's (1976) subject pool was restricted to high school seniors who had been 
accepted for admission to universities in the fall. It seems unlikely that the two populations were 
comparable in their basic levels of mental abiUty, although Hennessy and Merrifield were carehil to 
partial out the effects of socioeconomic status. 

4. An analysis of covariance, directly comparing group means on subtest scores, provides 
different information than a factor analytic comparison, including a comparison of the factor means. 
Factor analysis is a data reduction technique for mathematically analyzing the intercorrelations between 
members of a set of variables and thus deducing a smaUer set of factors. Those factors are assumed to 
account for the intercorrelations seen in the directly measurable original variables. The factors are 
arbitrarily named and interpreted (hopehilly based on a theoretical model of the construct being studied); 
comparing factor means is wt the same thing as comparing observable test score mean differences. A 
test could measure the same underlying mental abilities in four groups and yet produce a ve y different 
pattern of strengths and weaknesses in subtest performance across those four groups. In other words, 
it is possible that the groups show the same pattern of intercorrelations between subtests, but differ in 
the level of their original mean scores on subtests that critically load on a given factor. Group A could 
have consistently lower scores than Group B on all measures loading on Factor 1, and still show the 
same overall pattern of intercorrelations between those tests. 

5. A dLfference in group means does not imply that most individuals in a group will have 
scores that faU in the direction of the observed group mean. Methodological rigor demands analysis of 
not only group means, but also individual data in conjunction with the group data. As Guertin, Frank, 
and Rabin (1956) point out: "One methodological shortcoming is the failure to distinguish between a 
mean diagnostic group profile and modal patterns of homogeneous subjects ... Only modal patterns 
are appropriate for diagnostic purposes" (p.239). For example, in a shidy of the WISC as a clinical 
diagnostic tool, Saccuzzo and Lewandowski (1976) found group differences on one subtest (Pichire 
Arrangement) that would indicate that a preponderance of the individual scores could be expected to 
fall above the mean in the higher group. When individual cases were examined, however, it was found 
that less than half of the cases achially were above the mean, and there were no consistent tendencies 
on this subtest. Therefore, the subtest could not be used as a clinical indicator. In another example, 
these investigators found no group differences between the races in terms of WISC responses on a 
number of Wechsler's hypotheses regarding acting-out adolescents. On post-hoc analyses, however, 
there were clear differences between white males and black females that were masked by the overall 
means. If the issue is one of fairness of selection criteria, then individual scores must be exammed m 
light of group means. 

Again the basic issue in the use of any test to select students for special programs is one of test 
use- fairness demands that the test be used in a way that will select equally from various groups, rather 
tha^i invariably favoring (or disfavoring) members of one group over another. It Ls certaiiUy possi^^^^^ 
design a test that appears to measure the same underlying constructs across groups, and still fmd that 
the test differentially selects members of one gi-oup over another because of the way it is being used. 
That may be the case with the common practice of using the WISC-R to identify intellechial giftedness 
in children. 
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The San Diego City Schools Studies 

Preliminary studies with a San Diego gifted population using a group measure of intelUgence, 
the Developing Cognitive Abilities Test, indicated that predictors of giftedness depend on ethnic 
background (Saccuzzo, Hermanson, Dome, Johnson, & Shamieh, 1990). For African-Amencans, U^e 
quantitative score proved most predictive, while for Hispanics the spatial and total scores were most 
predictive of selection for gifted programs. Total scores alone were most predictive for only the 
Caucasians and FiUpinos, who were overrepresented in the gifted and talented program. These f mdmgs 
suggested that giftedness may be expressed in imique patterns of abiUties not best measured by a 
summary IQ score. Althougli the study was not (and was not intended to be) a repUcation of Lesser et 
al 's work, the results did add weight to the idea that identifiable differences exist in the way giftedness 
is expressed across ethnic and cultural backgrounds. In hirther support of this hypothesis, a summary 
of academic performance of aU students in gifted programs in San Diego City Schools mdicated that 
Hispanic and African American students at aU grade levels generaUy faU below other groups (and 
below the 90th percentile) only in reading and language (Millett, 1990). Analysis of VIQ - PIQ 
discrepancies in a random subset of this population also revealed differences that vaned across ethnic 
background and as a function of the size of the discrepancy (Saccuzzo, Johnson, & RusseU, 1992). 

Given that the WISC-R is one of the single most widely used instruments for the identification 
of giftedness in the United States, and given the problem of underselection of certain ethnic groups, the 
goal of this study was to examine the feasibihty of using the WISC-R in any way to select a balanced 
population of gifted children,since it would appear that a summary IQ score wiU not do so. The present 
work began with an analysis (Study I) of the WISC-R Verbal, Performance, and Full Scale scores of 
chUdren who achieved a Full Scale IQ score at least two standard deviations above the mean (FSIQ > 
130) Children were compared and contrasted in terms of basic demographic factors such as ethnicity 
and gender, as weU as on environmental factors thought to place them at risk for limited expression of 
their hiU potential (e.g., economic, language, and emotional factors). Verbal-Performance differences 
were examined in a study of base rates for the entire sample of inteUectually gifted children as weU as 
for subsamples defined on the basis of ethnic background, areas of risk, and documented low or high 
school achievement test scores. In spite of exceUent discussions by Kaufman (1976) and Matarazzo and 
Herman (1984) on the difference between statistical and clinical significance, Uttle has been documented 
about the relative rarity of specific VIQ-PIQ discrepancies in different populations of children. Fmdmg 
that a child has a statisticaUy significant VIQ-PIQ difference tells the clinician or educator nothing more 
than that the difference is probably real and not due to chance: it does not address the issue of the ranty 
of that difference in a given population or of its real world significance, nor does it address the lixehhood 
that such a VIQ-PIQ difference is associated with low achievement. Only by studying actiial occurrence 
in a population can we address such issues. Kaufman (1976) noted no differences in base rates across 
ethnic backgrounds in children with IQ values of at least 120 in the standardization sample. Two 
serious problems with that finding are that Kaufman did not take into account the durection of the 
difference (only the size), and that there were almost certainly not enough non-Caucasian children m 
the sample at those IQ levels to have found a difference even if it existed: the total number at that IQ 
level was 213. The present shidy was undertaken to provide accurate base rates for a large, cultiirally 
diverse sample of gifted children, with the hope that more evidence could be provided to dispel mvaUd 
uniformity assumptions and to shed light on this gifted population. 

Study II examined the feasibility of deriving a single set of criteria from the WISC-R to select a 
proportionately representative, ethnically diverse sample of children for inclusion in programs for the 
gifted by exploring two alternative hypotheses: (1) there exists a single pattern of WISC-R subtest scores 
tiiat predicts giftedness equaUy across gender and ethnic background; or (2) there is a unique pattern of 
cogiitive strengths and thus different predictors of giftedness for each group. Ethnic backgrounds 
represented included African-American, Asian, Caucasian, FiUpino, and Hispanic. 

General Considerations in the Use of Tests for Giftedness 

Again the basic issue is one of competent test use. Despite ongoing discussion, 
ackn.owledgment of the limitations of IQ tests, and exhortations to use these tests in an mf ormed maimer 
(Borland 1986; Kaufman & Harrison, 1986; Robinson & Chamrad, 1986; Sternberg, 1982), no smgle 
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study to date in the educational or psychological literat-are has directly and adequately addressed the 
issue of fairness of the use of this test in an ethnicaUy diverse population of potentiaUy gifted children. 
Goals of this shady included possible explication of a more fair and adequate use of the WTSC-R in 
identification of giftedness, a discussion of selection bias that results from its use, and further 
understanding of tho limits as weU as the full potential of the WISC-R in the selection of shidents from 
diverse backgrotmds for gifted programs in schools. 

Benefits to be gained from improved methods of selection are substantial. If we are to increase 
the number of underrepresented minorities in the professions, as morally and legaUy mandated, it is 
vital to identify and encourage those individuals as early as possible. What are the consequences if we 
continue to faU in this endeavor? They are perhaps best summed by D. D'Souza (1991) in his description 
of the experience of one university noted for its aggressive affirmative action policy: 

...the academic difficulties encountered by affirmative action students who find it impossible 
to compete effectively with other, better-prepared students, are reflected in Berkeley's exb^mely 
high dropout rate for Hispanic and black undergraduates. Whites and Asians graduate from 
Berkeley at about the same rate: 65-75 percent. That is to say that 25-35 percent drop out before 
graduation. Hispanics graduate at under 50 percent. More than half drop out. Blacks graduate 
at imder 40 percent. More than 60 percent drop out. 

...Berkeley does not release the number of blacks and Hispanics admitted on affirmative 
action who drop out, but these data are contained in a confidential internal report which ta'acks 
freshmen enroUed in 1982. By 1987, five years later, only 18 percent of blacks admitted on 
affirmative action had graduated from Berkeley; blacks admitted in the regular program 
graduated at a 42 percent rate. Similarly o. Jy 22 percent of affirmative action Hispanics finished 
in five years, compared with 55 percent for other Hispanics. The most recent figures suggest 
that approximately 30 percent of black and Hispanic shidents drop out before the end of their 
freshman year; in the words of the report, they seem to stay "only long enough to enhance the 
admissions statistics." (p. 39) 

I would propose that the key phrase is "better prepared" shidents and suggest that such preparation 
must begin as etirly in elementary education as possible. 

Inclusion of more equitable proportions of high risk children in gifted programs is a goal much 
sought in education. A unique opporhanity exists in San Diego to shidy selection procedures for gifted 
and talented programs: a large, ethnically diverse meta'opolitan population plus a school dista'ict that 
continues to demonstrate its commitinent to identification of underrepresented and disadvantaged 
students. 



II. Methods 

Two shidies were completed: I) an analysis of the base rates of VIQ-PIQ differences in a 
population of inteUechiaUy gifted chUdren, defined as those who achieve a FuU Scale IQ score at least 
two standard deviations above the mean, and H) an examination of the use of the WISC-R to select a 
proportionately representative and ethnicaUy diverse sample of gifted children hrom the population of 
chUdren identified as potentially gifted. 



Subjects 



Each child in this shidy was identified as potentiaUy gifted based on achievement test data, 
teacher evaluation (Appendix A) and recommendation, and a social case shidy analysis (Appendices B 
and C). The social case shidy analysis included an assessment of 6 areas of potential risk for achievement 
and expression of hiU potential: 1) culhiral, 2) economic, 3) emotional, 4) environmental, 5) health, and 
6) language Cultural risk included culhiral values and beUefs that differ from the those of the dommant 
culhire or Umited experience in the dominant culhire. Economic risk included parental unemployment 
or household income low enough to quaUfy the chUd for the tree lunch program. Emotional risk 
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encompassed such factors as death of a parent, child abuse, major psychiatric illness in the home, or 
extended absence of a parent due to military service. Environmental risk included transiency (three or 
more school moves) and excessive absences from school due to home responsibilities such as child care 
responsibility or working to help support the family. Health factors included vision, speech, or hearing 
deficits requiring designated instructional service, motor problems requiring adaptive physical 
education, or diseases such as asthma. Children at risk due to language included those for whom 
English is a second language and those not fluent in English, For the purposes of the current project, 
each child was assigned a value for level of risk: 0 if no identified risk, 1 if risk was identified in one and 
only one of the areas described above, and >1 if more than one area of risk was identified for that child. 

Ethnic backgroimd was determined by self-report, based on an information questionnaire 
completed by parents at the time of their child's enrollment in the school district. Problems are inherent 
in such self-report, including the resultant heterogeneity of each group. For example, the chUd of one 
Caucasian and one Hispanic parent may be reported to be Caucasian or to be Hispanic, depending on 
societal factors that enter into the parents' decision to report: Being considered Caucasian might seem 
to confer some obvious dominant culture benefits, but being designated Hispanic might open 
opportunities for scholarship or for special tutorial programs in a given school. 

Ethnic categories designated by this district are broad and in themselves create heterogenious 
groups: 'Hispanic' includes those from Mexico, Cental and South America, Puerto Rico, Cuba, and 
Spain. Children from those different ethnic and cultural backgroimds may be more dissimilar among 
themselves than they are from children of other ethnic categories such as African-Americans or 
Caucasians. 

Under the selection model used by this school district, each of the children to be certified 
gifted must have achieved a scoi-e on a nationally standardized group achievement test in the 90th 
percentile or higher. Since not every child is referred for evaluation, several sources of referral bias may 
begin at this stage of the process (e.g., based on gender, culture, or verbal skill level). Each then was 
further evaluated with a nationally standardized individual test of intelligence. Children were 
subsequently certified gifted in one of two ways: 1) an IQ score two standard deviations above the 
national mean or higher (e.g., WISC-R FSIQ > 130), or 2) an individual IQ score > 120 plus the presence 
of two or more identified areas of risk, as discussed above. An examination of the risk factors 
demonstrated coi\siderable heterogeneity within each ethruc group and across ethnic groups, as would 
be expected (see Figure 1). Again, problems in the use of self-report data can be seen. Certain groups 
may tend to under-report, and teachers may tend to selectively report factors seen more frequently in 
one group than in another (e.g., language, which is especially obvious without much depth of knowledge 
about the child or family). 

Figure 2, Within each ethnic group, the percentage at each level of risk in the population of children 
referred as potentially gifted. 
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Each chUd in this study was given the Wechsler IntelUgence Scale for Children-Revised (WISC-R) 
by a school psychologist as part of the evaluation process between 1984 and 1991. The two supplemental 
subtests (Mazes and Coding) were not routinely administered in tiiis district, and tiie Comprehension 
and Digit Span subtests were given to too few of ihe children to be included in multivariate analyses. 
The omission of Comprehension and Digit Span for so many children introduced another possible 
source of bias, in that prorated IQ scores were used for those children and may not represent the same 
FuU Scale score as would have resulted from ihe inclusion of aU subtests. Furthermore, tiie decision to 
administer those two sxibtests to some but not all children may have been based on systematic differences 
in attributes such as verbal facility and/or cultural and language differences. 

Study I For tfie study of observed base rates of VIQ-PIQ differences, ihe sample included every 
African-American, Asian, Caucasian, Filipino, and Hispanic child who achieved a Full Scale IQ score 
of at least 130 (by definition, two standard deviations above ihe mean) on the WISC-R between the 
years 1984 and 1991, inclusive. Forty sbc percent were female. Ethnic composition of the sample is 
summarized in Table 1. 



Table 1. Composition of the VIQ-PIQ base rate sample 



Percent of each group 
at each Level of Risk 



Group 


Number 


0 


1 


>1 


African-American 


252 


52 


20 


28 


Asian 


202 


53 


20 


27 


Caucasian 


4895 


71 


16 


12 


Filipino 


182 


40 


20 


40 


Hispanic 


265 


49 


14 


37 


Total 


5796 









Heterogeneity of levels of risk for these children, within and across ethnic groups, can be seen 
in Figure 2. Comparison of Figures 1 and 2 strikingly demonstrates the WISC-R disadvantage associated 
with a high risk environment, for children of every ethnic background. In each ethnic group, children 
from high risk environments differentiaUy tended to score below 130 in FSIQ on the WISC-R aivd so 
were selected out of the sample for the base rate study. Use of a single cut-off score by a school district 
would obviously tend to exclude those children from enrichment programs as well. 

Figure 2. Within each ethnic group, the percent at each level of risk in the population of 
children with FSIQ at least 130. 
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study 11 19,826 children were identified as potentially gifted by the San Oiego City School District in 
the years from 1984 through 1991. A total of 8396 children were subsequently administered the WISC- 
R, while others were evaluated with other instruments such as the Kaufman Assessment Battery for 
Children (Kaufman & Kaufman, 1983). From the group administered the WISC-R, a random sample 
of 1438 (713 female) was chosen to be ethnically proportionate to the composition of the district 
population in the academic year 1990-1991 (see Figure 3). The random sample consisted of 258 African- 
American, 36 Asian, 560 Caucasian, 128 Filipino, and 456 Hisparuc children. Size of the sample was 
limited by the proportionately small number of Hispanic children administered the WISC-R, as 
compared to their numbers in the district population. In its determination to find equitable selection 
methods, this school district uses tests other than the WISC-R whenever possible with the predominantly 
Spanish-speaking members of its large population of Hispanic children. 



Figure 3. San Diego Unified School District ethnic composition, 1991/1992. 
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m. Results 

Study h Base Rates for VIQ-PIQ Differences 

Descriptive statistics for the WISC-R scores of the sample of 5796 children with FuU Scale IQ 
values of at least 130 are presented in Table 2. 

Table 2. Verbal Performance, and Full Scale Scores as a Function of Ethnic Group 



Group 


Mean 


{sd) 


no 

Mean 




ESIQ 
Mean 


(sd) 


African- 










136.7 


(6.09) 


American 


136.2 


(8.02) 


129.6 


(9.00) 


Asian 


135.0 


(9.97) 


136.6 


(8.78) 


139.8 


(7.18) 


Caucasian 


136.4 


(8.55) 


132.3 


(9.08) 


138.4 


(6.56) 


Filipino 


132.3 


(9.63) 


134.5 


(8.92) 


137.3 


(6.18) 


Hispanic 


135.2 


(8.89) 


133.6 


(7.95) 


138.4 


(6.33) 



Preliminary analyses were conducted to examine the trends in this group of inteUectuaUy gifted 
children. Gender effects were analyzed in a 2 (Gender) by 3 (Test Score) mixed repeated measures 
analysis of variance. Significant main effects were found for Gender, F(l, 5794) = 53.67, p < .001, but 
there was no interaction effect. Boys, on the average, scored higher than girls, as can be seen in Table 3. 
Given the standard error of measurement of the WISC-R, although the differences were statishcaUy 
significant, they were clinically irrelevant. 



Table 3. Verbal, Performance, and Full Scale Scores as a Function of Gender 



VIQ PIO ESIQ 

Gender Mean (sd) Mean {sd) Mean (sd) 



Female 135.5 (8.39) 131.9 (8.83) 137.6 (6.24) 

Male 136.8 (8.85) 133.0 (9.25) 138.9 (6.77) 



Verbal Performance, and FuU Scale IQ values for each etimic group were analyzed in a 5 
fEthnicitv) X 3 (Test Score) mixed repeated measures analysis of variance. Results revealed significant 
mreJSts for Ethnicity! F(4, 5791) = 7.41, p < .001 and for Test Score F(2, 5791) = 200.54 P <^001. 
-mese main effects must, however, be interpreted in Ught of the sigmficanl Ethnicity by Test Score 
interaction F(8 5791) = 25.70, p < .001. As can be seen in Table 2 and confirmed m post hoc multiple 
Scheffe comparisons, FiUpino children were significantly lower in Verbal IQ scores than African- 
American, Caucasian, or Hispanic chUdren. On the other hand, FiUpinos were higher m Performance 
IQ than African-Americans or Caucasians, and African-Americans were lower thai^ any other group. 
Clear differences in pattern of strengths and weaknesses among these gifted chUdren seem apparent. 
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To investigate the possibility that observed ethnic group differences in Verbal IQ scores could 
be due primarily to differences in risk status, a oneway analysis of covariance was performed with 
level of risk as the covariate. Results indicated that level of risk was a nonsignificant covariate, and that 
ethnic status remained a significant effect, F(4,5790) = 1L50, p < .001, regardless of risk. 

In a series of oneway analyses of variance, level of identified risk was foimd to have a significant 
effect only on Verbal IQ scores, F(2, 5793) = 9.46, p < .001, but not on Performance or Full Scale IQ 
scores. Post hoc Scheffe comparisons revealed that those with one and only one identified area of risk 
obtained Verbal scores significantly higher than those with no risk, whose scores were higher than 
those with more than one risk area (see Figure 4). 



Figure 4. WISC-R scores as a function of level of risk. 
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Thus the presence of multiple areas of risk or hardship in a gifted child's environment appears to be 
associated with lower performance on 'he Verbal Scale of the Wechsler, while the presence of one 
unspecified risk factor alone does not. 

In an effort to understand the finding that children with one and only one nsk had higher 
mean VIQ than those at no risk, a series of hypotheses was tested. The first hypothesis was that, 
among children with a single identified risk, either ethnic groups with higher mean VIQ (i.e., Asians 
and cfaucasians) or males (who had higher VIQ than females) were disproportionately highly 
represented. A oneway analysis of variance compared VIQ in the two levels of risk, with ethnic group 
membership and gender as covariates. Ethnicity was a significant covariate, F(l,4892) = 9.39, P < -Ol- as 
was gender, F(l, 4892) = 27.45, p < .001. Risk level, however, remained a significant mam effect, F(l,4892) 
= 6 87 p < 01 Therefore, the VIQ differences across level of risk did not appear to be a simple function 
of eth^^city or gender alone. In fact, a 2 (Risk Level) X 2 (Gender) X 5 (Ethnicity) ANOVA demonstrated 
significant maSi effects for Level of Risk, F(l,4876) = 7.27, p < .01, Gender, F(l,4876) = 27.78 p < .001, and 
Edxnicitv F(l 4876) = 6.43, p < .001. None of the interaction effects were significant. Smce neither 
Render nor ethnic background accounted for the differences in VIQ across risk, an alternative hypothesis 
tiiat type of risk accounted for the higher mean in one-risk children was investigated m a oneway 
analyst of covariance with type of risk as the covariate. Type of risk was a significant covariate 
F(l 4893) = 9 24 p < .01. When the variance accounted for by type of risk was removed, level of nsk 
was no longer a'significant effect. To fiirther elucidate this finding, firequencies of ethnic background 
and gender across type of riisk were examined in children with only one area of nsk. Most frequent was 
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emotional risk (30.5% of the total), followed by health (26.1% of the total) and environmental (19.0% of 
the total) The presence of cultural, economic, or language hardship alone was relatively rare (2.2%, 
10 1% and 11.7%, respectively). Within the group at emotional risk, 89.8% of the chUdren were Asian 
and Caucasian; 53.0% were male. Among those at health risk, 96.0% were Asian and Caucasian; 64.9% 
were male. In the environmental risk group, 89.1% were Asian and Caucasian; 56.8% were male Type 
of risk appears to be a mediator for ethnicity and for gender, and the higher mean VIQ scores m children 
with only one risk area appear to be explainable in terms of a higher proportion of males and of Asians 
and Caucasians (all associated with higher mean VIQ) in the group of chUdren identified with only 
emotional, health, or environmental risk than in the overall sample. 

Base rates for the difference hetweecx Verbal and Performance IQ score were obtained and are 
summarized in Tables 4 through 8. Ranges were defined to be consistent with those of Matarazzo and 
Herman (1984), so that comparisons with their findings could be made. 

Table 4. African-Americans: Cumulative Percentage Distribu<:ions of the Difference 
Between WISC-R VIQ and PIQ 



Size of the 
Difference 
Between 
VIQ and PIQ 



%V>P 
(+ Difference) 



Sum of WISC-R 
%P>V + and - Cumulative 

(-Difference) Differences Percentage 



30 and above 


3.97 


0 


3.97 


26-29 


3.57 


0 


3.57 


22-25 


5.56 


1.98 


7.54 


19-21 


5.16 


.79 


5.95 


16-18 


8.33 


1.19 


9.52 


13-15 


6.35 


1.19 


7.54 


10-12 


5.56 


3.17 


8.73 


7-9 


9.92 


4.37 


14.29 


4-6 


6.75 


7.94 


14.69 


1-3 


11 .11 


9.92 


21.03 


0 









100.00 
96.03 
92.46 
84.92 
78.97 
69.45 
61.91 
53.18 
38.89 
24.20 
3.17 



Table 5. /Asians: Cumulative Percentage Distributions of the Difference Between WISC-. 
VIQ and PIQ 



Size of the 
Difference 
Between 
VIQ and PIQ 



%V>P 
(+ Difference) 



30 and above 
26-29 
22-25 
19-21 
16-18 
13-15 
10-12 

7-9 

4-6 

1-3 
0 



0 
.99 
2.48 
2.48 
3.47 
4.46 
5.94 
5.94 
9.9 
8.91 



Sum of WISC-R 
%P>V + and - Cumulative 

(-Difference) Differences Percentage 



.99 
1.49 
3.% 
2.97 
4.95 
4.46 
7.43 
10.40 
8.91 
6.44 



.99 
2.48 
6.44 
5.45 
8.42 
8.92 
13.37 
16.34 
18.31 
15.35 



100.00 
99.05 
96.57 
90.13 
84.68 
76.26 
67.34 
53.97 
37.63 
18.82 
3.47 



CO 



Table 6. Caucasians: Cumulative Percentage Distributions of the Difference 

Between mSC-RVlQandPlQ 



^i7p of thp 
"P^i ff p rpn TP 

LiT^ (. TV V^V^X I 

VTO and PIO 


%V>P 
(+ Difference) 


%P>V 
(-Difference) 


Sum of WISC-R 
+ and - Cumulative 
Differences Percentage 


30 and above 


2.35 


.16 


2.51 


100.00 


26-29 


2.25 


.57 


2.82 


97.51 


22-25 


3.49 


1.41 


4.90 


94.69 


19-21 


5.05 


1.53 


6.58 


89.79 


16-18 


5.99 


2.49 


8.48 


83.21 


13-15 


6.07 


2.80 


8.87 


74.73 


10-12 


8.09 


5.03 


13.12 


65.86 


7-9 


8.83 


5.78 


14.61 


52.74 


4-6 


8.95 


7.84 


16.79 


38.13 


1-3 


9.85 


8.34 


18.19 


21.34 


0 








3.15 



Table 7. Filipinos: Cumulative Percentage Distributions of the Difference Between 
mSC-RVlQand PIQ 



Size of th.e 
Difference 
Between 
VIQ and PIQ 



%V>P 
(+ Difference) 



Sum of WISC-R 
%P>V + and - Cumulative 

(-Difference) Differences Percentage 



30 and above 
26-29 
22-25 
19-21 
16-18 
13-15 
10-12 

7-9 

4-6 

1-3 
0 



I. 65 

0 

2.20 
3.85 
2.75 
5.49 
5.49 
4.40 

II. 00 
6.04 



1.65 
1.65 
5.49 

3.85 
9.89 
6.04 
3.85 
5.49 
9.34 
6.59 



3.30 

I. 65 
7.69 
7.70 

12.64 

II. 53 
9.34 
9.89 

20.34 
12.63 



100.00 
96.70 
95.06 
87.37 
79.67 
67.03 
55.50 
46.16 
36.27 
15.93 
3.30 



Table 8. Latinos/Hispanics: Cumulative Percentage Distributions of the Difference 
Between WISC-R VIQ and PIQ 



Size of the 

Difference Sum of WISC-R 

Between %V>P %P>V +and- Cumulafave 

VIQ and PIQ (+ Difference) (-Difference) Differences Percentage 



30 



and above 


1.13 


1.13 


2.26 


100.00 


26-29 


.75 


0 


0.75 


97.72 


22-25 


2.64 


.75 


3.39 


96.97 


19-21 


2.26 


1.89 


4.15 


93.58 


16-18 


6.04 


2.64 


8.68 


89.43 


13-15 


4.53 


4.15 


8.68 


80.75 


10-12 


6.04 


7.92 


13.96 


72.07 


7-9 


9.43 


5.28 


14.71 


58.11 


4-6 


10.57 


10.57 


21.14 


43.40 


1-3 


9.81 


10.19 


20.00 


22.26 


0 








2.26 



Inspection of these tables suggests striking differences between ethnic groups. To examine 
those differences, the VIQ - PIQ frequency distribution for each ethnic group was compared to a reference 
distribution using a Chi Square test witii 20 degrees of freedom. The reference distidbution chosen was 
tiiat of \he standardization sample for \he Wechsler Adult InteUigence Scale-Revised, sported by 
Matarazzo and Herman (1984), since tiiose autiiors reported direction as well as magnihide of tiie VIQ- 
PIQ difference Hispanics and Asians were not found to differ from \he WAIS-R standardization sample 
African-Americans: x^(20, N = 252) = 196.9, p<.001, Caucasians, Z^(20, N = 4895) = 1382.6, p< 001, and 
FiUpinos, x'(20, N = 182) = 90.7, p<.001, did differ significantly from tiie reference disti-ibution. The 
nature of those distiibutions is shown in Figure 5. 
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Figure 5. Distributions of (VIQ - PIQ) differences as a function of ethnic group. 
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Again we see the trend for African-Amercan and Caucasian children to have higher VIQ than 
PIQ, while the reverse is true for Filipino children. The previous analyses found the trend in group 
mean scores. The Chi Square differences between VIQ and PIQ confirm those findings in individuals 
and further strengthens the evidence for differences in patterns of strengths and weaknesses across 
ethnic background. 

Given these differences in distributions between ethnic groups, it becomes crucial to look at 
population incidence of large VIQ-PIQ discrepancies as a function of ethnic background. Only in this 
way can we determine whether an event that is rare in one group, and is taken as a cliiucal indicator of 
abnormality, also holds for other groups. Within each ethnic group, occurrences of magnitudes of 
VIQ-PIQ discrepancies were counted so that population rarity could be compared with statistical 
significance (as presented in the WISC-R manual) for each group. That is, a VIQ-PIQ difference of 12 
points has been found to be statistically significant at the .05 level. This finding is frequently 
misinterpreted to mean that only about 5% of normal children will have a difference of that magnitude. 
However, Kaufman (1976) pointed out that approximately 30% of normal children with average 
intelligence have discrepancies at least that high, as do 36% of children in the standardization sample 
with IQ scores of at least 120. In Table 9, the difference required for statistical significance is compared 
to that actually observed in each of the ethnic groups. For example, a difference of 12 points Ls needed 
to be sure (within an error probability of .05) that a child's Verbal and Performance abilities are 
significantly different. For the Asian children in this sample, however, a difference of 25 points or more 
is needed in order for the difference to be rare enough to be observed only about 5 percent of the time. 



Table 9. Empirically Different Magnitudes of VlQ-PlQ Discrepancies, as a Function of Ethnic Group 



Magnitude of 
Difference Required 
Statistically* 



Magnitude of Difference 
Empirically Observed at Each 
Level of Probability 



p value 




Caucasian 


Hispanic 


Afr.Amer. 


Asian 


Filipino 


.15 


8 


20 


18 


22 


19 


21 


.10 


10 


23 


19 


25 


22 


23 


.05 


12 


27 


23 


29 


25 


27 


.01 


15 


34 


40 


35 


30 


38 



to be reliably different from 0 



Considerable variation between groups can be seen in Table 9. Although a VIQ-PIQ difference 
of 30 points is rare in the gifted Asian population of our sample (occurring only about once in every one 
hundred children), 5 in every one hundred African-American children are observed to have that 
difference, and even more chUdren in each of the other three groups. One can easily imagine a scenano 
in which, for example, norms are set using a predominantly Asian population, rare (less than 5% of the 
population) VIQ-PIQ differences are defined to be a diagnostic indicator for leammg disabilities, and 
that standard is used for all children. In this particular gifted sample, such a cntenon could lead to 
labelling twice as many Caucasian, FiUpir.o, and African-American children as Asian or Hispanic leammg 
disabled. Tlie scenario is admittedly an exaggerated one and it is to be hoped that m achial prachce 
one single test is never the sole criterion for diagnostic or placement decisions. 
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The effect of level of risk was further examined as base rates among children at no, low, and 
high (2 or more identified areas of risk) risk were compared (see Figure 6). 

Figure 6. E)istributions of (VIQ - PIQ) differences as a function of level of risk. 
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Distributions were statistically compared using Kolmogorov-Smimov 2-sample tests. Children 
with no identified risk differed from those with one risk (Z = 1.526, p < .05), and children with only one 
risk differed from those with more than one risk (Z = 1.951, p < .001). As can be seen in the figure, 
children from high risk backgrounds more frequently tended to have higher PIQ than VIQ. This comes 
as no surprise, in Ught of findings from group means that children at high risk have lower mean VIQ. 
Each of these children achieved a FuU Scale IQ of at least 130. In order to accompUsh that in the face of 
a disadvantaged VIQ, PIQ must be even higher than for those at no risk. Again, we see differences in 
pattern across groups. 

Finally, subsamples of this demonstratedly gifted sample were selected so that rates of VIQ- 
PIQ differences could be compared in gifted high and low achievers. For this purpose, scores on the 
California Test of Basic Skills (CTBS) were obtained. Two subsamples were selected; 96 chUdren whose 
CTBS scores were all at a stanine of 9 were designated "high achievers", and 108 children whose CTBS 
scores were all at a stanine of 6 or below were called "low achievers". Single classification ANOVAs 
revealed that the groups did not significantly differ in PIQ; for high achievers, M = 132.2 (SD=9.7), 
while for low achievers M=130.6 (SD=9.1). Low achievers were, however, significantly different from 
high achievers in VIQ, F(l,203) = 13.49; p < .001. Group means were 137.8 (SD=8.5) and 133.4 (SD=7.8). 
respectively. 

Figure 7. Distribution of VIQ-PIQ differences at the extremes of achievement. 
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VIQ - PIQ distributions for the two groups are shown in Figtire 7. No significant differences 
were found betv/een the two distributions (Kohnogorov-Smimov Z = 1.007, p =.263). This impUes that 
use of large VIQ-PIQ discrepancy as an indicator of risk for low achievement is indeed faUacious, smce 
relatively large VIQ-PIQ discrepancies are as likely to be seen in high achievers as in low achievers. 
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study 11 : Multivariate attempts to use the WISC-R to select an ethnically balanced gifted population. 

An ethnically balanced random sample of 1438 children identified as potentially gifted was 
generated. Mean scores on the eight subtests of the WISC-R routinely administered in the district are 
summarized in Table 10. 



WISOR Scores in a Randomly Balanced Sample of Children Identified Potentially Gifted 

Entire African- 
Score Sample American Asian Caucasian Filipino Hispanic 

127.0 134.8 133.1 128.5 129.8 

(11.9) (10.2) (10.0) (11.4) (11.8) 



FSIQ 130.6* 
(11.3)'^'^ 



VIQ 
PIQ 

Information 

Similarities 

Arithmetic 

Vocabulary 

Picture 

Completion 
Picture 

Arrangement 
Block 

Design 
Object 

Assembly 



129.1 


127.5 


129.8 


132.0 


lz4.1 


1 07 Q 
iZ/.O 


(12.1) 


(12.1) 


(12.4) 


(11.0) 


(12.5) 


(12.5) 


125.6 


120.4 


132.8 


127.1 


127.2 


125.6 


(12.3) 


(12.8) 


(11.6) 


(11.4) 


(12.3) 


(12.3) 


13.5 


12.9 


14.1 


14.1 


12.9 


13.3 


(2.5) 


(2.4) 


(3.1) 


(2.4) 


(2.4) 


(2.6) 


15.8 


15.7 


15.2 


16.2 


14.8 


15.8 


(2.4) 


(2.5) 


(2.5) 


(2.2) 


(2.7) 


(2.4) 


13.9 


13.5 


14.4 


14.3 


13.6 


13.8 


(2.4) 


(2.4) 


(2.3) 


(2.3) 


(2.3) 


(2.4) 


14.7 


14.6 


14.9 


15.2 


13.8 


14.2 


(2.6) 


(2.5) 


(2.6) 


(2.4) 


(2.8) 


(2.7) 


13.2 


12.7 


13.5 


13.3 


13.0 


13.4 


(2.4) 


(2.3) 


(2.2) 


(2.4) 


(2.4) 


(2.4) 


14.0 


13.3 


14.5 


14.2 


14.0 


14.1 


(2.7) 


(2.7) 


(2.7) 


(2.7) 


(3.0) 


(2.7) 


14.0 


12.7 


16.2 


14.4 


14.7 


13.8 


(3.0) 


(3.0) 


(2.8) 


(2.8) 


(2.7) 


(2.9) 


13.3 


12.4 


13.8 


13.6 


13.3 


13.3 


(2.8) 


(2.9) 


(2.8) 


(2.6) 


(2.9) 


(2.7) 



* Mean 

Standard Deviation 



Inspection of Table 10 reveals the problem experienced by any diverse school district in its 
efforts to provide equal access to gifted programs based primarily on Full Scale IQ as measured by the 
WlSC-R. As has happened in San Diego City Schools, Caucasians and Asians wiU be over-represented, 
while Hispanics and African-Americans wHl be under-represented. Assuming that the WISC-R does 
indeed predict academic achievement and that an ethnic balance in gifted programs is a desirable and 
in fact necessary goal, each ethnic subsample was divided on the basis of FSIQ: the upper 70 /o o each 
eroup was designated "gifted" for the purposes of the foUowing analyses, and the lower 30 /o of each 
was designated "nongifted". Those percentages were estimated based on the overaU number of children 
referred for individual testing versus the 70% finally selected for inclusion in gifted enrichment 

classrooms. , , , , , 

Stepwise multiple linear regression analyses were carried out on the scaled scores of the whole 
sample, as well as each ethnic subsample, in order to determine which subtests of the WISC-R best 
predict giftedness for each group. Results are summarized in Table 11. 



Table 11. Stepwise Multiple Linear Regression Models to Predict Giftedness 



Sample 


Subtests m the Model 


IS 


I\ 


i\ 


Entire Sample 


Vocabulary 


.156 






UDject AssemDiy 


172 








Picture Completion 


.153 








Similarities 


1 A7 
.10/ 








Block Design 


1 RH 
.LOV 








Picture Arrangement 


.144 








Arithmetic 


1 








Information 


1 ni 






African-Americans 


Information 


.LOU 








Object Assembly 


.189 








Similarities 


.163 








DiocK L-'esign 


.150 








Picture Arrangement 


.140 








Anthmetic 










Vocabulary 


.114 






Asians 


Information 


.457 






Til 1 T~\^ Z 

Block Design 






Caucasians 


Object Assembly 


.236 








Vocabulary 


. lOl 








Picture Completion 


.183 








Arithmetic 


1SQ 
. L^y 








Block Design 


179 
. 1/^ 








Similarities 


1 c? 
.10/ 








Picture Arrangement 






.567 




Information 


.112 


.753 


Filipinos 


Picture Arrangement 


9Q1 






Picture Completion 


.2/1 








Block Design 


9ftA 




489 




Similanties 


.Z/4 


.077 


Hispanics 


Similarities 


99fi 






Object Assembly 


.168 








Information 


.154 








Picture Completion 


.151 








Picture Arrangement 


.154 








Vocabulary 


.161 


.735 


.533 




Block Design 


.131 



For each group except Asians, ttve best stepwise selection model was able to account for 
approxin-cately 50% of the variance or more: for Asians, was only .36. Variables in the model differed 
across ethnic groups, as weU. For Caucasians, as for the sample as a whole, aU subtests entered mto the 
equation For Hispanics, only Arithmetic failed to enter, whUe for African- Am.ericans the Picture 
Completion subtest did not enter the model. The best-fitting model for Filipinos included Picture 
Arrangement, Picture Completion, Block Design, and SimUarities. Only two subtests were mclude m 
the mwiel for Asians: Information and Block Design. Again we see differences in pattern of strengths 
and weaknesses, reflected in different predictors of giftedness across ethnic background. 
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To determine the efficacy of the best predictive model discriminant analysis was performed 
for the entire sample using all subtests as predictors and gif tedness as the criterion. With two groups 
(gifted and non-gifted), one discriminant factor was generated. Results are summarized in Table 12. 



Table 12. Discriminant Function Coefficients for the Identification of Gif tedness in the Entire 
Ethnically Balanced Sample 



Standardized Coefficients 



Information 


.252 


Similarities 


.376 


Arithmetic 


.111 


Vocabulary 


.265 


Picture Completion 


.270 


Picture Arrangement 


.273 


Block Design 


.225 


Object Assembly 


.298 



Pooled Correlations Between Subtests 
and the Discriminant Function 

Similarities .566 

Vocabulary .555 

Information .549 

Object Assembly .461 

Block Design .437 

Picture Completion .424 

Picture Arrangement .412 

Arithmetic .347 



False positives, false negatives, and hit rates, in percentages, are provided for the whole sample 
and for each ethnic group within that sample in Table 13. 



Table 13. Hit Rates, False Positives, and False Negatives for tlw Best Overall Model 







False 


False 


Group 


Hit Rate 


Positives* 


Negatives** 


Entire Sample 


89.3 


7.5 


19.9 


African-Americans 


81.7 


0 


23.6 


Asians 


93.9 


33.3 


0 


Caucasians 


87.9 


42.9 


.5 


Filipinos 


89.6 


3.6 


12.4 


Hispanics 


95.0 


4.2 


5.3 



* of those who were not gifted, the percent called "gifted" 

** of those who were gifted, the percent called "nongifted" by the model 



The most critical errors are represented in the "false negatives" column of the table. Those 
numbers represent children who have imusuaUy high potential that would not be recognized. Those 
children would be denied a chance to excel in special programs for the inteUectuaUy gifted. When we 
examine false positive and negative rates for subgroups, we see the repetitive pattern of over-selection 
of Caucasians and Asians accompanied by the under-selection of African-Americans. The one group 
for whom this model is an improvement is Hispanics. Thus we demonstrate that no one single model 
using the WISC-R, no matter how sophisticated and complex, will select an ethnically proporHonate 
sample for inclusion into enrichment programs for the gifted. 



The efficacy of individual models of selection, based on ethnic background, was investigated 
by performing discriminant analyses on each ethnic subsample, using all available subtests of the 
WISC-R as predictors and giftedness as the criterion. Results are sununarized in Tables 14 through 18. 

Table 14. African-Americans: Discriminant Function Coefficients for the Identification of Giftedness 

Pooled Correlations Between Subtests 
Standardized Coefficients and the Discriminant Function 



Information .298 

Similarities .287 

Arithmetic .216 

Vocabulary .195 

Picture Completion .132 

Picture Arrangement .254 

Block Design .267 

Object Assembly .310 

Eigenvalue .9512 WUks' Lambda .513 



Information 


.573 


Object Assembly 


.549 


Arithmetic 


.522 


Similarities 


.517 


Block Design 


.512 


Vocabulary 


.500 


Picture Arrangement 


.455 


Picture Completion 


.364 



Table 15. Asians: Discriminant Function Coefficients for the Identification of Giftedness 
Standardized Coefficients 



Pooled Correlations Between Subtests 
and the Discriminant Function 



Information .513 

Similarities .011 

Arithmetic .174 

Vocabulary .247 

Picture Completion .213 

Picture Arrangement .268 

Block Design .556 

Object Assembly -.055 



Information .648 

Vocabulary .579 

Arithmetic .554 

Picture Completion .502 

Block Design .500 

Similarities .379 

Object Assembly .311 

Picture Arrangement .209 



Eigenvalue .746 



Wilks' Lambda .573 



Table 16. Caucasians: Discriminant Function Coefficients for the Identification of Giftedness 



Standardized Coefficients 



Pooled Correlations Between Subtests 
and the Discriminant Function 



Information 
Similarities 
Arithmetic 
Vocabulary 
Picture Completion 
Picture Arrangement 
Block Design 
Object Assembly 



.202 
.289 
.294 
-324 
.337 
.227 
.313 
.417 



Object Assembly .480 

Vocabulary .453 

Information .436 

Block Design .419 

Picture Completion .392 

Similarities .392 

Arithmetic .386 

Picture Arrangement .331 



Eigenvalue 1.310 



Wilks' Lambda .433 
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Table 17. Filipinos: Discriminant Function Coefficients for the Identification of Giftedness 



Standardized Coefficients 



Pooled Correlations Between Subtests 
and the Discriminant Function 



jjiiorin anon 


.003 


Picture Arrangement 


.507 


Similarities 


.321 


Vocabulary 


.486 


Arithmetic 


.172 


Similarities 


.4/ i 


Vocabulary 


.199 


Block Design 


.468 


Picture Completion 


.419 


Object Assembly 


.427 


Picture Arrangement 


.435 


Picture Completion 


.424 


Block Design 


,413 


Information 


.376 


Object Assembly 


.230 


Arithmetic 


.359 


Eigenvalue 1.076 


Wilks' Lambda 


.482 





Table 18. Hispanics: Discriminant Function Coefficients for the Identification of Giftedness 
Standardized Coefficients 



Pooled Correlations Between Subtests 
and the Discriminant Function 



Information 


.252 


Similarities 


Similarities 


.376 


Vocabulary 


Arithmetic 


.111 


Information 


Vocabulary 


.265 


Object Assembly 


Picture Completion 


.270 


Block Design 


Picture Arr'u'igement 


.273 


Picture Completion 


Block PL-sign 


.225 


Picture Arrangement 


ObjevC Assembly 


.298 


Arithmetic 



.566 
.555 
.549 
.461 
.437 
.424 
.412 
.347 



Eigenvalue 1.191 



Wilks' Lambda .456 



Hit rates, false positive and false negative rates for the use of these individual functions are 
summarized in Table 19. 



Table 19. Hit Rates, False Positives, and False Negatives for the Best Individual Discnminant 
Function Models 







False 


False 


Model 


Hit Rate 


Positives 


Negatives 


African- American 


93.6 


17.5 


3.1 


Asian 


90.9 


16.7 


7.4 


Caucasian 


95.0 


12.2 


2.3 


Filipino 


93.6 


7.1 


6.2 


Hispanic 


93.8 


7.6 


5.6 



* of those who were not gifted, the percent called "gifted" 
of those who were gifted, the percent called "nongif ted" by the model 



By using individual models and capitalizing on differences in pattern of strengths and 
weaknesses across ethiuc groups, rates have been improved for some groups. Caucasians and Asians 
are still overrepresented, as now are African-Americans. Hispanics and Filipinos have nearly equal 
fcilse positive and false negative rates. An important note is that the models used to obtain these rates 
are based on functions that are weighted sums of subtests, and not simple combinations of subtests 
providing easy cut-off scores. These are the best rates available, based on fairly complex linear 
combinations. Any combination of subtest cutoff scores used in actual practice would necessarily have 
lower success rates. 

To investigate the possibility that discrimination of nongifted from gifted could be improved 
by grouping those with similar patterns of abilities, African- Americans and Cauceisians were considered 
together in one discriminant model. Both groups had a tendency for higher VIQ than PIQ (see baserate 
study, above). However, Caucasians are traditionally overselected and African- Americans 
underselected. Results of the analysis are presented in Table 20. 

Table 20. Discriminant Function Coefficients for Groups whose VIQ Exceeds PIQ 
(African-Americans and Caucasians) 



Pooled Correlations Between Subtests 
Standardized Coefficients and the Discriminant Function 



Information 


.150 


Object Assembly 


.542 


Similcuities 


.290 


Vocabulary 


.526 


Arithmetic 


.257 


Information 


.509 


Vocabulary 


.304 


Similarities 


.481 


Picture Completion 


.273 


Block Design 


.473 


Picture Arrangement 


.219 


Arithmetic 


.473 


Block Design 


.224 


Picture Completion 


.433 


Object Assembly 


.349 


Picture Arrangement 


.407 



Eigenvalue .8867 Wilks' Lambda .530 



Using this discriminant model, overall hit rates have gone down to 80.6% for African- Americans 
and 90.1 for Caucasians. The misses, as expected, favor Caucasians (34.7% false positives, .5% false 
negatives) and agao disadvantage African- Americans (0 false positives, 19.4% false negatives). No 
manipulation will improve the rates from the best individual ethnic group discriminant models, obtained 
using all available subtests. 

One might be tempted to argue that identification could be improved and gifted programs 
could be ethnically balanced more economically by choosing each group's strongest subtest cind basing 
the decision on a cutoff score applied to a diffei^t subtest for each group. For example, as was seen in 
Table 10, African-Americans' highest mean scaled score was Information, Asians' was Block Design, 
and so on. Only 24.5% of African- Americans scored below 12 on Information and only 25% of Asians 
scored below 15 on Block Design. Therefore the same cutoff score would not work in both groups on 
the individually selected subtests. 

In similar fashion, it might be proposed that there exists one subtest that, at a given cutoff 
score, would select a balanced population. Not only does that prove not to be the case, but a more 
fundamental issue is involved in this and in the proposal to use a different subtest for each group. The 
basic argument for using the WISC-R as a selection tool for intellectually gifted enrichment programs 
is that it in some way measures a broad array of abilities associated with achievement in school. By 
narrowing the test down, even to four subtests (much less one or two), the predictive power of the test 
is greatly diminished. 



Thus we see that no single selection model using the WISC-R will result in an ethnically 
balanced sample of gifted children from this population. In fact, the most accurate and most 
complicated individual models for ethnic subgroups are not uniformly accurate either. There appears 
to be no way to use the WISC-R to derive cut-off inclusion/exclusion scores in this ethnically diverse 
sample for use in selecting balanced populations for gifted programs in the schools. 



rV. Discussion 

Intellectually gifted children show differences in the pattern of their strengths and v/eaknesses 
on the WISC-R, across ethnic background and across levels of risk in the environment. In the first 
phase of this work, an analysis of base rates of VIQ-PIQ differences in 5796 children with FSIQ scores 
at least two standard deviations above the mean (FSIQ > 130) revealed that African-Americans and 
Caucasians tended to have a higher VIQ than PIQ, whereas in Filipino children the tendency was the 
reverse These trends, evident in group data, were confirmed in frequency distributions of mdividual 
difference scores. The distributions of VIQ-PIQ difference scores of Asians and Hispanics most closely 
resembled those obtained by Matarrazzo (1984) from the standardization sample for the WAIS-R, and 
most closely approximated normal distributions. 

Groups divided on the basis of level of risk from factors such as significant health problems, 
economic hardship, emotional deprivation, or cultural and linguistic factors were also found to differ 
in pattern of strengths and weaknesses. Such hardships proved to be consistently associated with 
lower VIQ relative lo PIQ in the individual. It can be surmised that, of all children at risk from two or 
more of these factors, this sample contained only the most invulnerable children— only the children 
stiU able to achieve a Full Scale score two standard deviaUons above the mean— and that in a randomly 
selected population across IQ ranges, the differences would be more extreme. Comparison of the 
relatively low proportion of high-risk chUdren seen in the gifted base rate sample, as opposed to the 
entire sample of children referred for giftedness assessment, appears to corroborate that hypothesis 
(refer to Figures 1 and 2). 

The myth that relatively large VIQ-PIQ discrepancies are somehow a diagnostic indicator 
for learning difficulties, such as low achievement relative to potential, was debunked in this sample. 

Groups of gifted chUdren at the extremes of achievement (aU achievement scores in the ninth stanme 
versus aU achievement scores in the sixth stanine or lower) were compared and found to have equivalent 
ranges of VIQ-PIQ difference scores. 

In the second phase of the work, an ethnically proportionately balanced sample of 1438 
potentially gifted chUdren was randomly selected. From that sample, selection models were denved 
and examined for goodness of fit in an effort to find a way to use the WISC-R to select a balanced 
population for educational enrichment programs. Alternative hypotheses that 1) there exists a smgle 
pattern of subtest scores that predicts giftedness equaUy across ethnic background, or 2) there is a 
unique pattern of cognitive strengths and thus different predictors of giftedness across groups were 
investigated. The best single model obtained by discriminant analysis appeared accurate overall. When 
examined in terms of individual ethnic groups, however, it proved to be biased in favor of Caucasians 
and Asians, and biased against African-Americans and FiHpinos. As was seen in the base rate shidy, 
different patterns of strengths were evident across groups. Even accounting for those differences with 
individual best-fitting models, efforts to improve selection balance failed. The very best mdividual 
models overselected African-American, Asian, and Caucasian children. No way was found to use the 
WISC-R to select a proportionately balanced population. If individual subtests or combinations of 
two or more subtests are used, as is suggested by some authors (Wrks, Vessels Quarf°rth, & Q^^no^- 
1980; Elman, Blixt, & Sawicki, 1981; Karnes & Brown, 1981; Kaufman 1979; Killian & Hughes, 1978, 
Sattler, 1988), discriminabiHty suffers. Perhaps more importantly predictive power of the WISC-R is 
decreased. 

The present results confirm the findings of differences in pattern of WISC-R perfoimances 
between etimic groups reported by Saccuzzo et al. (1992). Moreover, for the first time, base rates for 
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VIQ-PIQ difference scores are presented for a large, ethnically diverse sample of gifted children. The 
distributions of VIQ-PIQ difference scores were found to be substantively different in shape as well as 
in direction across ethnic groups, in contrast to the report that discrepancy scores "did not vary too 
greatly with ... race" at any level of IC. in the standardization sample of the WISC-R (Kaufman, 1976). 
In fact, statistical power was too lov for identification of differences had they existed in the higher end 
of IQ scores in the standardization sample. Also for the first time, high levels of risk in a child's social 
and home environment have been shown in the present work to be associated with lower VIQ relative 
to PIQ in children in the upper end of the IQ distribution. Moreover, the influence of risk factors 
appears to confer a disadvantage over and above any effect of ethnic background. 

No model was found to enable the WISC-R to be used to select equal proportions of gifted 
children from a variety of ethnic backgrounds. Therefore, in an ethnicaUy diverse population, it would 
seem that Sattler (1988) is correct in saying that children who are culturally different are difficult to 
identify. The results of commonly used identification practices can be seen nationwide in the over- 
representation of Asians and Caucasians, as well as the under-representation of African-Americans 
and Hispanics. The results of this study strongly suggest that use of the WISOR in diverse 
populations as the primary selection device for gifted programs is an inappropriate use of the test, 
if one of the goals of such use is to select proportionately representative numbers from each group. 

Uniformity assumption myths abound in psychological assessment. Sattler (1988) presented 
cogent arguments in favor of national norms and against the idea of pluralistic norms. He pointed out, 
in part, that the WISC-R was standardized on a carefully stratified sample with ethnic minorities 
represented in proportion to their representation in tlie population. That is certainly true on a national 
level, but the appropriateness of using national norms so derived to define cut-off scores in a population 
which is predominantly non-Caucasian, as is this school district, is questionable. Normative scores 
are derived based on factors known to affect test performance: the WISC-R manual provides only age- 
corrected norms. The current study has demonstrated that differences in test performance on the 
WISC-R exist across ethnic background and across gender. In the past, the major psychological and 
educational assessment devices have been standardized primarily in terms of age or education, and 
sometimes gender. More recently, authors have stressed the importance of differences and the need for 
sets of norms that consider multiple factors such as gender, age, and education concurrently (Heaton et 
al, 1986), especially when these scores are used for clinical decision-making. Use of the WISC-R for 
selection of individual children for enrichment programs is, in essence, a clirucal decision-making 
process. The idea of pluralistic norms based on ethnic background, however, is poUticaUy an extremely 
sensitive issue. Perhaps the clearest conclusion from the findings of the present work is that the test, 
with existing norms, produces scores that are certainly more appropriate for some groups of children 
them for others. 

If "intelligence" or "intellectual giftedness" were to be defined as exactly those abilities 
underlying the quahty of an individual's performance on the WISC-R, then we would have to conclude 
that the WISC-R is the best instrument to use for selection, regardless of any socio-political considerations 
such as the need for ethnic, socioeconomic, or even gender balance. However, we are dealing with a 
theoretical construct (intelligence) imperfectly measured by the WISC-R within a known error of 
measurement. This work has demonstrated that groups divided either on ethnic background or on 
environmental factors differ in the pattern of their performances on the WISC-R. The results do not 
and can not address the issue of how much of the individual and group differences are a result of 
biologic (presumably neural) differences or of environmental influences. Aside from the social and 
poUtical implications of the use of the WISC-R as an entry criterion in diverse populations, the results 
of the present work indicate that the basic assumption of uniformity of pattern of performance across 
groups on the WISC-R is flawed. For whatever reason, be it biologic, environmental, or a combmation 
of the two, pattern of performance across groups is not uniform. 

Instead of attempting to find a way to continue to use the WISC-R in gifted selection models, 
it may behoove educators to adopt the use of multiple test instruments, including a nonverbal instrument 
such as Raven's Progressive Matrices (Raven, 1938) plus a measure of verbal reasoning abiHty as weU 
as behavioral and motivational indicators. In any case, inclusion/ exclusion decisions should never be 



based on a single test score, just as no clinical decision should be solely based on any one score or even 
on one test. 



The studies reported here are limited by the same factors inherent in any quasi-experimental 
design as well as by the limitations of archival and cross-sectional research. Attempts weiC made to 
control for gender and to examine ethnic and risk level effects. It should be pointed out, however, that 
the original sample from which subsamples were drawn was not a random, multivariate normal sample 
from the entire population. Instead, tliis was a sample of children referred by parents, teachers, and 
central nomination for assessment because each had in some way demonstrated the potential for 
intellectual giftedness. Almost certairdy biases were Inherent in that referral process. One of those 
biases can be seen in the unequal proportions of children from different ethnic backgrounds. Another 
concerns the under-representation of WISC-R scores from Hispanic chUdren v/ho have English as a 
second language, and are often tested with other assessment devices. 

Self-report questionnaires are a source of error from both under-reporting and over-reporting. 
Ethnic background was deduced by response of the primary caregiver to a school district questionnaire, 
and incidence of risk was gathered from information questionnaires provided by teachers and by parents. 
For example, a child who Ls 10% Native American may be reported as Native American, while one who 
is Hispanic /Caucasian may be reported as Caucasian because of beliefs the parents hold about the 
impUcations for their child of certain ethnic designations within the education system. The risk factors 
examined in this work are almost certainly an under-representation of population incidence. Those at 
risk may have been under-reported both by teachers who have have less contact with particular groups 
of parents, and by the parents who are overwhelmed by the same environmental stressors that affect 
their children. It is likely that the more seriously economically and environmentally disadvantaged 
have less access to health care and may also mistrust an educational establishment that doesn't seem to 
be addressing their most pressing needs. On the other hand, affluent parents may over-report certain 
risk factors, such as health problems and emotional problems. 

There could be other areas of risk not included in the risk factor questionnaires used by this 
district. For example, acculturation issues are complex and are not v/ell investigated in these 
questionnaires. Other than the self-report of cultural differences in the home (in the student-parent 
questionnaire. Appendix C), no attempt could be made to divide groups on the basis of acculhiration, 
since we did not have access to detailed structured interviews. Lastly, ethnic categories were broad and 
included diverse groups within some single categories. For example, the one category "Hispanic" 
included Latinos, Cubans, Puerto Ricans, and Hispanics. The group "Asians" included chUdren of 
Japanese and Chinese background. It may be that more differences exist within these heterogeneous 
groups than across our ethnic categories. 

In terms of the test data itself, some subtest scores were frequently missing from the data since 
the school district, due to time and financial constraints, does not routinely administer all of the subtests 
of the WISC-R. Therefore Coding, Mazes, Comprehension, and Digit Span could not be included in the 
multivariate modeling phase of the study. It may be possible to find more accurate selection models 
with the WISC-R if those subtests are included. 

FinaUy, the sample was drawn entirely from the San Diego area. Results may not generalize to 
other geographic areas: San Diego is a metropoUtan area with a preponderance of Latinos in its Hispanic 
population and a significant proportion of first and second generation Asians in its Asian population. 
As noted above, the majority of the students in the district are non-Caucasian. 

On the other hand, the sample reported here is derived from an ethnically diverse district that 
has consistently shown a commitment to identification of disadvantaged students for gifted programs. 
The sample does consist of the entire population of children referred and subsequentiy administered 
the WISC-R as part of the selection process for gifted education programs. The number of non-Caucasian 
children, particularly in the gifted base rate sample, is larger and more diverse than any previously 
reported. Moreover, a completely balanced sample of 1438 was randomly selected from an overaU 



sample of 8396. Lastly, if risk factors were mdeed under-reported and ethnic groups were heterogeneous, 
the likelihood of finding clear differences between groups would be decreased. Nevertheless, differences 
were found. 

This work has demonstrated clear differences in pattern of abilities across ethnic backgrounds 
and across levels of risk in the children's environment. Population rates of VIQ-PIQ discrepancies 
have been documented, and the importance of the difference between statistical significance and clinical 
rarity across ethnic groups has been illustrated. No single model using the WISC-R was found to 
provide proportionately equal access to gifted programs. Individual models based on ethnic background 
failed to achieve ethnic balance, since individual models over-selected African- Americans, Asians, and 
Caucasians relative to FiUpinos and Hispanics. Therefore, use of the WISC-R in a diverse population 
to select a balanced group was demonstrated to be inappropriate. 

Other instruments, such as Raven's Progressive Matrices, need to be tested in such a large, 
multicultural population, as has been repeatedly recommended (Baska, 1986; Pearce, 1983; Valencia, 
1984). Some combination of assessment devices that account for motivation as weU as mteUectual 
potential may need to be evaluated. Given that we find a way to identify greater proportions of 
disadvantaged children with high potential, the focus then must turn to finding ways to ensure that 
these "different" children express that potential. The children we identify with alternative methods 
may not be the verbally gifted, behavioraUy compliant children who currently populate the gifted 
classrooms in this distiict. Further work will need to focus on changes in the enrichment programs 
themselves, to enable teachers of the gifted to unlock and direct the potential these children demonsti-ate. 
Improved identification is certainly a goal that needs to be met, but it wQl be an empty victory if it is 
achieved and the children so identified faU to be able to express that potential in ways tiiat add to their 
own growth as well as the growth of their cultures and societies. 

There is a dearth of data on how best to nurture particular kinds of talents: that lack of research- 
based knowledge, combined with administrative inflexibiHty in the use of resources (particularly in a 
climate of increasing budget constraints), bodes poorly for chUdren whose giftedness is expressed not 
so much in verbal domains as in other intellectual areas. We have seen from the experience at Berkeley 
and other universities (D'Souza, 1991) the disturbing outcome of including individuals who would 
not have qualified based on established, braditional, uniform criteria into a system which does not 
change to fit their needs. Dropout rates are high, and we do not know the long-term negative effects of 
the experience for those who are not able to complete the program. Future shidies are needed to 
examine ways to develop effective educational programs for diverse classes of very young gifted 
children, and longihidinal shidies wiU be necessary to evaluate the effectiveness of the interventions 
so developed. Understanding of motivational principles, group process, and cultiaral as well as 
individual differences in achievement needs and in pattern of abilities is vital for the development of 
interventions to provide mastery experiences for these children early in the educational process. Only 
then can we provide more effective sti-ategies for engaging these children in the life-long growth and 
development of the potential they as individuals possess. 
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Appendix A. 



San Diego City Schools 
Educational Services Division 
Gifted and Talented Education 

Tp^( ;lipr Nominati on Form 



Date_ 



Student ^ • ^ 

j^^j^g^ Date Sex Ethnic Code_ 

(last) (first) (mi) 
S^qqI^ ^Grade Room Number 



rj TTDR FOR IDEN1TFTCATION OF PROSPECTTVF, GATE CHILDREN 

Please rate (name) o^i each of the following characteristics. This is a 

five^point scale with the lower end of the scale (#1) indicating lower than average performance and 
the upper end (#5) indicating excellent or exemplary performance. 



I. PERSONAL 

1. Curious; asks many questions 

2. Self-motivated; requires little 
external direction or encouragement 

3. Likes to organize people and structure activities 

4. Generates many ideas, questions, and suggestions 

5. Flexible; adapts readily to new situations 

6. Impatient with routine tasks 



n. FXPRESSION 

7. Vocabulary beyond chronological age or grade level 

8. Advanced skill in written expression 

9. Proficiency in oral expression 



III. THOUGHT PROCESSES 

10. Quick and accurate recall of factual information 

11. A storehouse of information on a variety of topics 

12. Readily recalls visual information 

13. Readily recalls auditory information 

14. Generalizes leanung from one experience to another 

15. Finds differences and similarities in events 

16. Understands concepts without extensive concrete 
examples 

17. Can establish relationships between seemingly 
unrelated concepts and ideas 

18. Is insightful about cause and effect relationships 
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IV, ppn pT TrnoN and ou ti^ut 

19. Displays a great deal of imagination 

20. Manipulates ideas (i.e., makes changes and 
elaborates upon them) 

21. Concerned with improving or adapting objects ai\d systems 

22. Capable of intense concentration on tasks of interest 
to her/him 

23. Does not give up easily when confronted with a challenge; 
shows determination in achieving goals 

24. Offers unique, clever responses to questions 

25. Resourceful, knows where to find answers 

V. ACHIEVEMENT 

26. High performance (grades) in a particular subject, 
e.g., math, language arts, science, other 

27. Achieves at a high educational level 



28. Has strong communication skiUs; gets ideas across 

effectively — 

29. Assumes leadership role easily 

30. Facilitates and directs efforts 

Vn. OTHER CHAR ACTERISTICS 

31. Dominates situations 

32. Expressive of thoughts and opinions 

33. Compulsive about work and work habits; strives for 

perfection ~ 

34. Becomes involved m task, loses awareness of time - 

35. Persistent in pursuing discussion beyond cutoff point 

36. Appears inattentive, withdrawn (daydreams) 

Prepared by Recommended? Yes No_ 

(Teacher) 

Recormnended? Yes ^No_ 



Reviewed by_ 



(Administrator/Designee) 



Appendix B. 



San Diego City Schools 
Educational Services Division 
Gifted and Talented Education 



TEACHER NOMINATION FORM 

Date_ 



Name_ Birth Date Sex Ethnic Code_ 

School Grade ^Track Room Number.. 



SOCIAL /ENVIRONMENTAL VARIABLES 

Please check all items that apply: 
L ENVIRONMENTAL 

Latks preschool/kindergarten experience 

Irregular attendance 

Transiency (3 or more school moves) 

Limited home enrichment opportunities (availability of books, periodicals, 

family interaction, family outings) 
Home conflicts: 

Responsibilities and study time 

Excessive child care responsibility 

Working to help support family 

Overcrowding — no study area 

Inconsistencies in the home 



2. ECONOMIC 

Economic hardship 

Single parent head of household 

Unemployment 

3. LANGUAGE 

Primary language of parent and /or student is other than EngUsh 

Not proficient/ fluent in English 

Uses non-standard English 

Student eru-olled in Second Language Immersion Magnet (SLIM) 

4. CULTURAL 

Limited home /school commimication 

Experience in dominant culture is limited 

Cultural values and beliefs differ from dominant culture 
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SOCIAL/ENVIROI>JMENTAL VARIABLES 
Page 2 



5. <;nrTAI7EMOTI ONAL 

Cliild abuse: physical mental neglect. 

Emotional /adjustment problems 

Working with district counselor 

Working with social worker 

Utilizing psychological services 

Other: • 

Significant home factors 

Separation 

Divorce 

Death 

Extended absence of parent 

Military 

Employment 

Other: ^ 

Family 

Single parent 

Remarriage /step-parent 

6, HEALTH 

Designated instructional services 

PHDIS 

Speech and language 

Vision 

Hearing 

Adaptive RE. 

Severe allergies 
Asthma 

Frequent medical/health referral 

Regularly prescribed medication 

Other: . 



Prepared by.^ Recommended? Yes No. 

(Teacher) 

Reviewed by___ Recommended? Yes No. 



so 
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Student Name: 



(Last) 
Address^ 



(City) 
Grade 



San Diego City Schools 
School Services Division 
Gifted and Talented Education 

STUDENT/PARENT INFORMATION FORM 

Date. 



Birth Date 



Sex School 



(First) (mi) 



Mother's name. 



(Street) 



Father's name„ 



(State) (Zip) 
Room Number 



_Occupation_ 
Work Phone_ 

_Occupation_ 
Work Phone. 



Track. 



Schools Attended 



Grade 



Home Phone. 
Dates Attended 



1. Names and ages of brothers and sisters:. 



2. Describe your child's attitude toward school:. 



3. List any special interests, talents, and skills your child may have:. 



4. What special lessons, traiiung or learning opportunities has your child had outside of school? 



J. To help us know more about your child, please check any of the following that apply: 



□ allergies □ frequent parent absence 

□ asthma □ parents separated 

□ frequent absences □ single parent 

□ prescribed medications □ remarriage/step-parent 

□ parent in roilitary □ recent death /significant 

illness in family 



□ 3 or more schools attended 

□ no kindergarten or pre- 

school experience 

□ additional language(s) 
spoken in home 



List:. 



6. Has your chUd been previously assessed? □ yes □ no If yes, when? 

7. What other things would you Like us to know that would assist us in assessing your child? 



Name of person 

completing this fom\ 



Relationship 
to student 



CHAPTERS 

Evaluation of Risk Factors in Selecting Children for Gifted Programs 



Part 1: Gifted Children at Risk: Evidence of an Association 
between Low Test Scores and Risk Factors 



Nancy E. Johrison, Dennis P. Saccuzzo, & Tracey L. Guertin 
San Diego State University 



Part 2: Intelligence, Aptitude, and Achievement in Gifted Children 
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Abstract 

Intellectually gifted children from diverse ethnic and cultural backgrounds as 
well as varying levels of risk were evaluated to determine the effect of risk on 
gifted children when intelligence level has been controlled. Each of the 7,323 
children from six ethnic backgrounds had achieved a standardized intelligence 
test score (Wechsler Intelligences Scale for Children - Revised or Raven's Standard 
Progressive Matrices) at least two standard deviations above the mean. Six areas 
of risk evaluated in a case study approach included cultural, economic, emotional, 
environmental, health and language factors. Although each child in the sample 
had demonstrated high intellectual potential, differences were found between 
groups defined on level of risk: no risk, low risk (one and only one area of risk), 
and high risk (more than one area of risk). High risk gifted children were found 
to be disadvantaged relative to those at low or no risk in all measures of both 
aptitude and achievement, as assessed with the Developing Cognitive Abilities 
Test and the Comprehensive Test of Basic Skills. Furthermore, those at high risk 
demonstrated lower WISC-R Verbal IQ scores than children at lower levels of 
risk. Implications and limitations of these findings for assessment of giftedness, 
identification of potential gifted underachievers, and development of gifted 
curriculum are discussed. 
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Gifted Children at Risk: Evidence of an Association between Low Test Scores and Risk Factors 



Since the 1960's, much has been written about gifted children at risk. One focus of this literature 
is the failure to identify gifted children at risk, presumably because of the use of standardized tests as 
criterion measures. Criticism has repeatedly been leveled at the use of standardized tests in any cliild 
considered to be at risk because of ethnic or cultural background (Cronbach, 1984; Sullivan, 1973; Sattler, 
1982), socioeconomic disadvantage (Fetterman, 1986; Harty, Adkins, & Sherwood, 1984; Shaw, 1986), 
health or handicapping condition (Pledgie, 1982), or differences in language (Sattler, 1982). Other 
investigators have noted the difficulty in the identification of giftedness in children with these risk 
factors (Albrecht & Rost, 1983; Bruch, 1971; Harty, et al., 1984). However, less attention has been given 
to children who, despite risk factors, perform at high levels on standardized tests of intellectual potential 
and so are recognized gifted. Passow (1989) suggested that priority areas of research in the education 
of high-ability children should include not only the need to identify and nurture giftedness in 
''disadvantaged" populations, but also explication of what is needed to transform potential into 
performance in the gifted. We hypothesize that intellectually gifted children at risk show patterns of 
disadvantage on tests of potential and achievement when compared to intellectually gifted children 
not at risk, and submit that these patterns may prove useful in the development of educational programs 
designed to unlock potential in gifted children at risk. 

Several factors have been shown to be associated with underachievement in children. Included 
in emotional factors are such stresses as parental conflict due to the absence of a parent, or child 
maltreatment. Zilli (1971) found that underachievers tend to come from broken homes. Wallerstein 
(1985) found that children from single parent homes have a higher rate of absenteeism as well as lower 
academic competence at school entry, and suggested that they may be burdened by too much 
responsibility for care of themselves and of younger siblings. However, the effects of emotional factors 
may be confounded with the effects of socioeconomic status, particulariy in single-parent homes. 

Low socioeconomic stahis has consistentlv been found to correlate negatively with identification 
for gifted programs (Fetterman, 1986; Harty, et al, 1984; Shaklee, 1992; Shaw, 1986). A 1983 (Albrecht & 
Rost) survey of San Diego gifted and talented classrooms using zip code as an indicator of socioeconomic 
status revealed that children from neighborhoods with higher housing prices were more likely to be 
identified gifted and to participate in gifted and talented programs. Pirozzo (1982) pointed out that 
socioeconomic s tahis is a major difference between potentiaUy gifted underachievers and gifted children 
whose achievement matches their potential. One weakness of many studies that attempt to look at 
socioeconomic status is that socioeconomic status is often confounded with ethnic or cultural 
background, and with poor health. 

Nichols and Anderson (1973) attempted to control for differences in socioeconomic status 
between African-American and White 7-year-old children to yield a clearer indication of ethnic 
differences between these two groups by matchii^g on SES, geographic location, and provision of prenatal 
care. These authors found that differences in IQ were reduced to approximately five points, in favor of 
White children, rather than the 15-point advantage often cited. In contrast to the approach of most 
investigators to examine deficits, Bruch (1971) reported the inteUectual strengths found in intelUgence 
test performance in gifted African-American children in the southeast. These strengths included practical 
problem-solving, memory operations, and visual and auditory figural content. 

It has been v^ell documented that low socioeconomic status affects the health of children and 
their motiiers. African- Americans, in particular, have a higher rate of premature birth and abnormalities 
in pregnancy tiian do Whites (Wiener & Milton, 1970). Because of multiple risk factors in ttieir 
environments, migrant children and homeless children are characteristic of those described by Haney 
(1963) as, "The most educationally deprived group of children in the Nation" (p. 101). Given the 
inconsistencies in their educational experience as well as the demands placed on them tr care for their 
siblings, it seems unlikely that the potential of such gifted children could be fuUy expressed (Frasier, 
1979 & 1982). As Laslow & Nelson (1974) pointed out, cultural differences, emotional problems, and 
physical disabilities are more likely to occur in children from lower socioeconomic households. 
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The present study attempted to examine a range of identified risk factors as well as level of risk 
for children who attained standardized intelligence test scores that were at least two standard deviations 
above the mean. Therefore, despite varying levels of risk, each of these children had demonstrated high 
academic potential and had been identified inteUechaaUy gifted. The sample was selected from a large, 
diverse sample of children who had been referred and evaluated for giftedness. Six areas of risk were 
examined: culhiral, economic, emotional, environmental, health, and language. Culhiral risk included 
cultural values and beUefs that differ from those of the dominant culture, or limited experience in the 
dominant culhire. Economic risk included parental unemployment or low household income. Emotional 
risk encompassed such factors as death of a parent or sibling, child abuse, psychiatric illness in the 
nuclear family, or extended parental absences due to military service. Environmental risk included 
transiency (three or more school moves) and excessive absences from school that were due to specific 
factors not related to another risk factor (e.g., absences due to home responsibilities such as child care or 
working to help support the family). Health risk included vision, speech, or hearing deficits requiring 
designated instructional service, motor problems requiring adaptive physical education, or chronic 
diseases such as asthma. Language risk included English as a second language or lack of fluency in 
English. 

For the purposes of this shidy, three levels of risk were defined. ChUdren at no risk were those 
who had no identified risk factors. Those at low risk were children who had identified risk in one of the 
six areas previously described. High risk children were those who had been identified as having risk in 
at least two of the areas defined. For the purposes of this shidy, level of risk was defined by number of 
areas of risk, rather than by absolute numbers of individual risk variables within the areas. For example, 
a chUd who had astb.ma as weU as a hearing deficit would, by definition, be at low risk since both risk 
factors faU in the one area of health risk. 

Since these risk areas have been shown to be associated with lower achievement in children, we 
hypothesized that, even among children identified inteUectually gifted based on inteUigence test scores, 
differences would be found in aptihide and achievement test scores as a function of level of risk in the 
child's background. 



Method 



Subjects , _ , 1 . • 

The sample was drawn from the 11,074 cliildren who had been referred for giftedness testmg m 
the San Diego City School System between 1984 and 1991. Of these referred children, 1,107 were Latino/ 
Hispanic; 7,751 were White; 855 were African-American; 393 were Asian; 390 were Indochinese; and 578 
were Filipino. Fifty-one percent were male. The majority of the cWldren were in the second (4^76), 
third (2,473), or fourth (U48) grade. The remainder were in the first (187), fifth (993), sixth (962), seventh 
(392), eight (276), or ninth (192) grade. Information on grade level was not available for 75 children. 

For the purposes of this shidy, a sample of aU children who achieved a WISC-R FuU Scale IQ 
score of at least 130 or a Raven IQ equivalent of 130 was selected. Thus, we isolated the group of high IQ 
scoring children A total of 7,323 cMldren met this criterion. Of these, 464 were Latmo/Hispamc, 5750 
White, 366 African-American, 279 Asian, 303 FiUpino, and 161 Indochinese. Fifty-four percent were 
male. 



Procedure 



Each subject was referred for giftedness testing by a parent or teacher, or through central 
nomination of high-achieving children. Each was then either individuaUy tested wi^th the Wechsler 
InteUigence Scale for Children - Revised (WISC-R) or group tested with the RavenStandard Progressive 
Matrices Test, and group tested with the Developing Cognitive Abilities Test (DCAT). The DCAT an 
aptihide test designed to predict academic achievement, provides estimates of potential for verbal, 
quantitative, and spatial abiUties. ReUability coefficients for the composite score are m the low .90 s, 



while those for the verbal, quantitative, and spatial scores range from the mid .70's to the mid .80's. The 
WISC-R or Raven and DCAT were used in conjunction with the child's scores on the Comprehensive 
Test of Basic Skills (CTBS), a standardized achievement test, A school psychologist then conducted a 
case study analysis of each child and determined, through a self-report questionnaire sent to the home 
and to the child's teacher, whether the child had any one of six risk factors: language, cultural, economic, 
emotional, health, and environmental. In the high IQ sample, 4,303 children had no identified risk 
factors; 1,340 had one; and 1,680 had more than one. The distribution of level of risk varied across ethnic 
backgroTond. Figure 1 illustrates the percentage of children at risk (one risk factor versus more than one) 
for each ethiuc group. The majority of white children had no identified risk factors, whereas the majority 
of Latino, African-American, Asian, Filipino, and Indochinese children had at least one. Seventy-five 
percent of Indochinese children had two or more risk factors, placing them at high risk for failure to 
achieve their potential 



Figure 1. Percentage at each positive level of risk (one risk factor versus two or more) for each ethnic 
backgrovind. 
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Results 



Table 1 shows the correlation among the various risk factors. Oue to the large number of subjects, 
most correlations are statistically significant. Notably high correlations were found between Cultural 
and Language risk, Economic and Environmental risk, Economic and Emotional risk, and Emotional 
and Environmental risk. 



Table 1 

Risk factor correlations (Pearson r) 



Cultiiral 


Economic 


Emotional 


Environmental 


Health 


Language 


Cultural 1.00 


.21" 


.05" 


.18" 


-.02* 


.61" 


Economic 


1.00 


.40" 


.30" 


.08" 


.15" 


Emotional 




1.00 


.30" 


.16" 


.05" 


Environmental 






1.00 


.09" 


.15" 


Health 








1.00 


-.01 


Language 










1.00 



* p < .05 
'*p< .01 



For all subsequent analyses. Level of Risk has been defined as No (no risk factors). Low (one 
risk factor, and High (more than one risk factor). 

For those children who had been given the WISC-R, data were analyzed in a 3 (Risk Level: No, 
Low, High) X 2 (IQ Score: Verbal, Performance) mixed repeated measures ANOVA with repeated 
measures on IQ score. Significant main effects were found for Risk Level, F(2, 5881) = 3-21/ p < .05, and 
for IQ Score F(i^ 5881) = 292.65, p < .001. There was also a significant Risk Level by IQ Score interaction 
f (2, 5881) = 11-21, p < .001. Given the number of subjects, observed power at the .001 level ranged from 
.992 to 1.000. Table 2 shows the means and standard deviations for the WISC-R data. 



Table 2 

WISC-R IQ scores: descriptive statistics for three levels of risk. 



No Risk Low Risk High Risk Marginals 

(n=3964) (n = 974) (n = 946) (d = 5884) 
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M 


SD 


M 


SD 


M 


SD 


M 


SD 


VIQ 


136.15 


(8.63) 


136.78 


(8.87) 


134.83 


(9.10) 


136.04 


(8.76) 


PIQ 


132.55 


(9.05) 


131.95 


(9.21) 


132.75 


(9.34) 


132.48 


(9.12) 


Marginals 


134.15 


(8.84) 


134.65 


(9.04) 


133.79 


(9.22) 







The significant interaction revealed that high risk children were particularly impaired in Verbal 
IQ. Post-hoc Newman-Keuls comparisons revealed that high risk children had significantly (p < .05) 
lower VIQ than no risk and low risk children, and that low risk chUdren had higher VIQ than those 
with no risk at all. No significant differences were found in PIQ. For ease of interpretation, the interaction 
is displayed in Figure 2. 



Figure 2. The interaction between level of risk and WISC-R IQ. 
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WECHSLER SCALE 



To examine the relationship between risk and specific Wechsler subtests, the data were analyzed 
in a 3 (Risk Level) X 8 (Subtest) mixed repeated measures ANOVA with repeated measures on the 
subtest scores. These analyses were based on the eight subtests routinely administered by the GATE 
department in this district (see Table 3). 



Table 3 

WISC-R subtest scores: descriptive statistics for three levels of risk. 

Entire Sample No Risk Low Risk High Risk 

Subtest (n=5611) («=3804) (n=917) (n=890) 



Information 


14.63 


(2.10) 


14.70 


(2.08) 


1^ 67 


(2.08) 


14.34 


(2.19) 


Similarities 


16.76 


(2.03) 


16.75 


(2.02) 


16.91 


(2.01) 


16.63 


(2.08) 


Arithmetic 


14.88 


(2.19) 


14.89 


(2.20) 


14.85 


(2.23) 


14.85 


(2.10) 


Vocabulary 


15.78 


(2.19) 


15.83 


(2.13) 


15.94 


(2.20) 


15.39 


(2.35) 


Picture Completion 


13.97 


(2.21) 


13.90 


(2.21) 


14.02 


(2.19) 


14.17 


(2.21) 


Picture Arrangement 


14.85 


(2.55) 


14.90 


(2.55) 


14.82 


(2.52) 


14.65 


(2.53) 


Block Design 


15.26 


(2.51) 


15.32 


(2.49) 


15.02 


(2.55) 


15.26 


(2.56) 


Object Assembly 


14.51 


(2.55) 


14.59 


(2.54) 


14.36 


(2.52) 


14.33 


(2.56) 



There was a significant main effect for Risk Level, F(2,5608) = 70.47, p < .001, as weU as a significant 
Subtest effect, F(7, 39256) = 555.00, p < .001. In general, children at high risk had lower scores than those 
at no risk, and the highest scaled scores were in Similarities for all groups while the lowest v/ere in 
Picture Completion. Also significant was the Risk Level X Subtest interaction, F(14, 39256) = 5.91, p < 
.001. Post-hoc Newman-Keuls analyses revealed that children at high risk were significantiy (p < .05) 
disadvantaged in Information and Vocabulary relative to the otlisr two risk levels. However, children 
at high risk had significantly (p < .05) higher scores than those at no risk on one subtest: Pichare 
Completion. Other significant differences included lower scores for high risk children relative to low 
risk children on Similarities and relative to no risk children on Picture Arrangement. Although these 
differences are statistically significant, as can be seen in Table 3, ttiey represent very smaU differences in 
scaled scores and should be interpreted with caution. 

Percentile scores for the DCAT were converted to z-scores and then analyzed in a 3 (Risk Level) 
X 3 (DCAT Score: Verbal, Spatial, Quantitative) mixed repeated measures ANOVA with repeated 
measures on DCAT scores. There was a significant main effect for Risk Level, F(2, 2592) = 44.47, p < .001. 
This effect showed that children at high risk had lower scores, summed across the three subtests, than 
those at no or low risk. The significant main effect for DCAT score, F(2, 5184) = 18.74 p < .001 demonstrated 
that Verbal scores tended to be higher than Quantitative scores, which were higher than Spatial scores. 
The significant DCA; score by Risk Level interaction, F(4, 5184) = 4.59, p < .001, revealed that for those 
at high risk. Quantitative scores were highest, followed by Verbal and then Spatial scores, whereas for 
those at no or low risk Verbal scores were higher than Quantitative. Post-hoc Newman-Keuls 
comparisons demonstrated significant (p < .05) disadvantage for diildren at high risk relative to those at 
low and no risk in all three DCAT scores (see Table 4). 
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Table 4 

DCAT scores: descriptive statistics for three levels of risk. 







Total 
(n=2595) 


No Risk 
(n=987) 


Low Risk 
(n=704) 


High Risk 
(n=904) 


Verbal: 


percentile (SD) 
z-score (SD) 


85.58 (15.09) 
1.35 (0.76) 


87.85 (12.50) 
1.46 (0.72) 


87.49 
1.42 


(12.98) 
(0.71) 


81.61 (18.12) 
1.16 (0.81) 


Quantitative: 


percentile (SD) 
z-score (SD) 


84.99 (16.30) 
1.28 (0.74) 


86.02 (14.46) 
1.32 (0.69) 


85.97 
1.32 


(15.18) 
(0.71) 


83.09 (18.70) 
1.21 (0.80) 


Spatial: 


percentile (SD) 
z-score (SD) 


83.82 (17.46) 
1.23 (0.78) 


85.59 (15.18) 
1.31 (0.73) 


85.00 
1.28 


(16.27) 
(0.75) 


80.97 (20.13) 
1.10 (0.8 j) 



Achievement scores, as measured by the CTBS, were analyzed in a 3 (Risk Level) X 3 (CTBS 
Score: Total Language, Total Reading, Total Math) mixed repeated measures ANOVA with repeated 
measures on the CTBS scores. Again, there was a significant main effect for Risk Level , F(2, 1616) - 
20 45 p < 001 This result again showed that chUdren at high risk achieved lower scores, summed 
across the three areas of achievement, than did those at low or no risk. There was also a significant 
main effect for CTBS score, F(2, 3232) = 221.61, p < .001, with significantly higher Total Math scores than 
Total Language, and significantiy higher Total Language than Total Reading. The mteraction of CTBS 
score and Risk Level, F(4, 3232) = 2.81, p < .05, indicated that children at low risk achieved scores equivalent 
to or lower than those at no risk in Total Reading and Total Math, but sUghtly higher than those at no 
risk in Total Language. Post-hoc Newman-Keuls comparisons revealed that those at high nsk were 
again significantly (p < .05) disadvantaged relative to those at low or no risk . The disadvantage was 
apparent for all three CTBS scores (See Table 5). 



Table 5 

CTBS stanine scores: descriptive statistics for three levels of risk. 







Total 


No Risk 


Low Risk 


High Risk 






(n=1619) 


(n=662) 


(n=388) 


(n=569) 


Total Language 


M (SD) 


7.65 (1.14) 


7.74 (1.07) 


7.78 (1.06) 


7.46 (1.24) 


Total Reading 


M (SD) 


7.43 (1.19) 


7.56 (1.06) 


7.56 (1.16) 


7.19 (1.31) 


Total Math 


M (SD) 


8.16 (1.09) 


8.22 (1.07) 


8.20 (1.07) 


8.07 (1.12) 



Discussion 



The present shidy compared intellechially gifted children at three levek of identified risk: no 
risk, low risk (one area of risk) and high risk (two or more areas of risk). These children were evaluated 
in six areas of risk: cultural, economic, emotional, enviromnental, health, and language. Eveiychild in 
the shidy had been referred as potentiaUy gifted and had subsequently achieved either a WISC-R FuU 
Scale IQ score of at least 130, or a Raven Standard Progressive Matrices score at least two standard 
deviations above the mean. Therefore IQ was tightly controlled. Every child m the study had 
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demonstrated high intellectual potential despite the presence of varying levels of risk for failure to 
fully express that potential. This design allowed us to evaluate the effect of risk without confounding 

IQ. 

The data support our hypothesis of a different pattern of scores in children at risk. This pattern 
is characterized by depressed verbal abilities. In general, children at high risk were found to be 
disadvantaged relative to those at low or no risk in all measures of both aptitude and achievement. 
Furthermore, those at high risk demonstrated lower \VISC-R Verbal IQ than those at lower levels of 
risk. The disadvantage in Verbal IQ was particularly evident in the Information and Vocabulary subtests. 
Thus our findings support the hypothesis that high levels of risk negatively impact expression of a 
gifted child's intellectual potential 

These data are not surprising. Low scores on Information and Vocabulary have long been 
known to be associated with risk factors such as poor reading ability, environmental deprivation, and 
emotional problems (Guertin, Ladd, Frank, Rabin, & Hiesler, 1971; Lewandowski & Saccuzzo, 1976; 
Saccuzzo & Lewandowski, 1976). The present study is unique in showing that Information and 
Vocabulary are depressed even while IQ is controUed and even for inteUectuaUy gifted children. 

The consistency of the low scores on Information and Vocabulary for children at risk across a 
variety of studies strongly points to the possible diagnostic significance of these subtests. Certainly, 
when evaluating tor giftedness, practitioners should be alert to the presence of risk factors whenever 
these scores are depressed. 

Other findings from the study should be interpreted more cautiously The finding of relatively 
high scores on Picture Completion needs to be considered in light of the standards promulgated by 
Lewandowski & Saccuzzo (1976). As these investigators noted, in order for a specific test sign to have 
reliable diagnostic significance, a number of methodological standards must be met. These include 
control of all major relevant variables, sufficient power, and cross-validation. 

The present study, while meeting the standards for control and power, did not include a cross- 
validation. Thus, before we accept the diagnostic significance of the Picture Completion subtest as a 
marker of risk, it would be important that the present findings be cross-validated on an independent 
sample. 

From the standpoint of the education of children at risk, a number of factors should be 
considered. First, despite their high IQ's, these children have a relatively depressed vocabulary. Teachers 
of gifted children at risk might, therefore, concentrate on enriching the vocabularies of these chUdren. 
The low score on the Information subtest may have several implications. It could be that these children 
do not have the same opportunity to acquire everyday factual information as do children not at risk. 
Alternatively, these children, due to such factors as economic disadvantage and environmental distress, 
do not have the same interest in acquiring factual infonnation as do more advantaged children. It 
would seem important that teachers of the high risk gifted attempt to bring relevance to the educational 
process for these cl'dldren. 

The results show a surprisingly high concentration of risk for Latino, African- American, and 
FUipino children. Practically 80 percent of the Utino, 60 percent of the African-American, and 70 percent 
of the FiUpino children had at least one risk factor. These findings reveal that individuals from these 
groups, even though gifted, have a highi potential for risk. Even more surprising was the finding that 
nearly 80 percent Indochinese had two or more risk factors. Educators working with these populations 
need to be sensitized to the high level of adversity these children face. It should be emphasized that 
these children, in spite of their adversity, still managed to score two standard deviations above the 
mean on a standardized IQ test. Children within tiiese groups who are not as intellectually capable 
clearly are far more vulnerable to the varieties of risk that they must deal with on a daily basis. 

There were some important correlations among the various areas of risk. The strong association 
between cultural and language risk is not surprising. Children who come from nonEnglish-speaking 
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homes, more often than not, come from a different culture. Another important area of association was 
found among environmental, economic, and emotional risk. Although cause-effect relationships cannot 
be discerned from these data, it is easy to see how these risk factors are associated. Teachers need to be 
alert to the association among these three areas of risk. 

The sample for this study was drawn from a population of children referred as potentially 
gifted. Therefore, it does not constitute a truly random sample of children two standard deviations 
above the mean in intelligence test scores. However, the sample does accurately represent children who 
have been identified and certified gifted based on attaiiunent of standardized intelligence test scores 
two standard deviations above the mean in a diverse school district that has come extremely close to 
achieving ethnic and cultural equity in its gifted education program. It represents children who have 
demor\strated their intellectual potential in an objective manner, despite varying levels of risk. Therefore, 
this sample powerfully illustrates the impact of risk factors on the expression of potential for gifted 
children. 
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Abstract 



Test patterns and the role of risk factors were investigated for a large 
sample (5004) of children from an ethnicaUy diverse school district. All children 
had IQ equivalents of 130 or greater on a major standardized intelligence test. 
ChUdren with no risk factors were compared to children with a language/ 
culture risk factor, and to a two-risk group of children with a language/ cultural 
risk factor plus an additional risk factor. Children with language /culture risk 
showed their highest scores on nonverbal tests, including quantitative aptitude 
and mathematics achievement, but had significantly (p < .05) depressed scores, 
compared to the no risk group, in aU verbal subtest areas. The data support 
Cummins' subtractive hypothesis, which states that bilingualism hindeis the 
expression of a child's abilities until the child is completely competent in both 
languages. 
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Numerous studies support the hypothesis that being bilingual enhances a person's performance 
on a variety of cognitive tests (Ben-Zeev, 1977; Cummins, 1976, 1978, 1981, 1986, 1989; Cummins & 
Gulutsan, 1974; Dash & Misahra, 1988; Diaz, 1983; Feldman & Shen, 1971; Gallegos & Franco, 1985; 
Hakuta, 1987; Hakuta & Diaz, 1985; Hakuta & Garcia, 1989; lanco-WorraU, 1972; Landry, 1974; Lindholm, 
1991; Peal & Lambert, 1962; Zentella, 1981). In all of these studies bilingualism has been defined as 
adding a second language to an already well-developed language, or acquiring a second language to 
replace the first language. Cummins' (1976) threshold hypothesis best explains the enhanced 
performance of bilingual individuals by suggesting that once an unspecified threshold point of fluency 
in both languages is reached, the languages complement each other and add to the bilingual person's 
ability to perceive the environment from multiple cultural viewpoints. Theoretically, tliose who are 
developing bilinguaUy are constantly monitoring and controlling two symbol systems that could possibly 
interfere with each other. AdditionaUy, comparing two languages may increase a person's ability to 
work with abstract ideas, such as language or quantitative tasks, that are commonly referred to as 
"metacomponential abilities" (Dash & Mishra, 1988; Reynolds, 1991). Until a threshold point of 
bilingualism is readied, however, the process of learning and being evaluated in a second language 
can only be subtractive, thus hindering the complete expression of one's abilities. 

Often, for minority language children in the American school system, neither English or the 
native language is adequately developed because native speakers of another language are immersed 
into classrooms where only English is spoken. The child's primary language is not used in the classroom; 
thus EngUsh is used only in a formal setting. The result is that children do not identify with either 
language and often speak a combination of the two; for example, "spangiish" for bilingual Spanish/ 
English speaking chUdren (Cauce & Jacobson, 1980). Teachers may erroneously assume that if a student 
is proficient in conversational English with peers, he or she is also English proficient in the classroom. 
Classroom proficiency, however, requires more than simply having an understanding of the language 
(Cummins, 1981). Cummins (1981) points out that parents may be encouraged by teachers or peers to 
speak to their children in EngUsh to help them obtain proficiency. However, since the parents are often 
not fluent, the child is at best exposed to poor English. At worst, parents may be 'uncomfortable speakmg 
in a foreign language and consequently may not communicate with their children as much. Thus, 
having a native language other than EngUsh is a risk factor in American classrooms. 

There is a cultural component coupled with any language risk factor a child may bring into the 
school system. A theory by Vygotsky (1962) states that language is initiaUy used as a form of social 
communication and eventuaUy evolves into a way of ordering thoughts and working with abstract 
symbol systems. Ln essence, speech and thought are not the same thing. Speech is simply one form of 
a symboUc thought process that humans use to communicate ideas (Ben-Zeev, 1977; Peal & Lambert, 
1962; Vygotsky, 1962). Each society is in fact limited by its language because it can only express those 
thoughts, feeUngs, and experiences that the language has words to express. Therefore, learning a 
language is a sociaUzation process that affects how one perceives the environment (Vygotsky, 1962). 
One's perception of the environment is often referred to as a cognitive set. TheoreticaUy, a person who 
is biUngual may have a broader cognitive set, plus more cultural viewpoints from wWch to view any 
new experience, and thus may accumulate more crystaUized inteUigence (CatteU, 1963). 

Becoming bilingual may also act as a disadvantage for children in terms of the testing measures 
traditionaUy used to assess inteUigence (e.g., the WISC-R, WISC-HI, or the Stanford-Binet Intelligence 
Scales) because most of the evaluations are performed in English, the minority language child's second 
language The measures are additionally biased in favor of the Anglo American cognitive set and 
culhire upon which they were based (Bemal, 1974; Melesky, 1985). As a result, poor scores are often 
obtained and may be interpreted as a learning disability or lack of subject knowledge, instead of a 
demonstration of the level of second language fluency and cultural assimUation the child has attamed 
(Ascher, 1990; Cummins, 1982; Ortiz, 1991). 



9R 



95' 



The present study evaluated a large sample of gifted children who had either no risk factors, a 
language/culture risk factor, or language/culture and an additional risk factor. Their aptitude and 
achievement scores were examined and compared in order to answer a number of basic questions 
pertaining to children with language as a risk factor. We hypotliesized that there would be a different 
pattern of test scores as a function of risk level for children with language risk. Since the bilingual 
children in this study were not fully competent in both languages, Cummins' subtractive hypothesis 
would be operative and result in relatively impaired performance. 

Method 

Subjects 

The subjects consisted of 5004 children who had been certified as gifted based on high intellectual 
functioning (i.e., IQ equivalent > 130) in the San Diego City School district between 1984 and 1990. IQ 
was measured with either the Weschler Intelligence Scale for Children-Revised or the Raven Standard 
Progressive Matrices Test. This was an ethnically diverse sample consisting of 295 Latino/Hispanic, 
3985 Caucasian, 170 African-American, 350 Asian, and 204 Filipino children. Of these, 2,362 (47.2 
percent) were female, 2,642 (52.8 percent) male. Children were distributed by grade as follows: 146 
first, 2218 second, 1085 third, 535 fourth, 382 fifth, 347 sixth, 143 seventh, 73 eighth, 61 ninth, and 14 
unknown. 

The sample was divided into three groups based on risk: no risk, a language/culture risk, or 
two risks (a language/culture risk factor coupled with one additional risk factor). Figure 1 shows the 
percentage of children at various levels of risk. Of the sample of 5004 children, 655 (13.0 percent) had 
language/culture as a risk factor. Of these 655 children, 71 (10.8 percent) had an economic risk factor, 
91 (13.9 percent) had an environmental risk factor, 88 (13.4 percent) had emotional risk factor, and 31 
(4.7 percent) had a health risk factor. 

Figure 1: Percentage of children with no risk, a language/ 
culture risk, or two risks, compared to their 
percentages in the sample as a whole. 



Figure 1 shows the 
percentage of children 
with no risk, a language/ 
culture risk, or two risks 
compared to their 
percentages in the sample 
as a whole. While Latino/ 
Hispanics represented 
about 6% of the total 
gifted sample, this group 
comprised about 23% of 
the children at language/ 
cultural risk. 




Q No Risk 

■ Lang./Culture Risk 

■ Lang./Culture Plus Another Risk 
□ Total Sample 
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procedure: 



Each subject was referred for giftedness testing at San Diego City Schools by a parent, teacher, 
or through central nominations. Each was then individually evaluated with either the Wechsler 
Intelligence Scale for Children - Revised (WISC-R), or the standard form of the Raven Progressive 
Matrices (RPM) Test. In addition, each child was given the Developing Cognitive AbiHties Test (DCAT). 
The DCAT is an aptitude test designed to predict academic achievement in verbal, quantitative, and 
spatial domains. To determine qualification for giftedness, a school psychologist then conducted a 
case study evaluation of each chUd using information from the individual evaluation, the Comprehensive 
Test of Basic Skills (CTBS; a standardized achievement test), and a consideration of risk. Risk was 
determined through a self-report questionnaire sent to the home and/or through a questionnaire 
completed by the teacher about the chUd. The five categories of risk were: language /culture (e.g. 
primary language other than English spoken in the home), economic (e.g. low income), emotional (e.g. 
death, divorce, seeking psychological services), health (e.g. physical disability, asthma), and 
environmental (e.g. frequent moves, academically unenriched home environment). 



Results 

WISC-R Verbal IQ (VIQ) and Performance IQ (PIQ) were analyzed in a 3 (Risk Level) X 2 (IQ 
Score) mixed repeated measures ANOVA with repeated measures on IQ Score. The significant main 
effect for IQ Score, F (1, 4338) = 5.71, p < .05, indicated that VIQ (M = 136.02, SD = 8.72) was significantly 
higher than PIQ (M = 132.72, SD = 9.02). 

These results are qualified, however, by a significant Risk X IQ Score interaction, F (2, 4338) = 
14 10 p < 001,illustrated in Figure 2. As confirmed by Newman Keul's post-hoc multiple comparisons, 
chUdren with no risk had significantly higher VIQ scores (M = 136.17, SD = 8.61) than children with 
only a Language / Culture risk factor (M = 134.52, SD = 10.10; p < .05) or chUdren with two nsk factors 
(M = 133.94, SD = 9.09; p < .05). The PIQ scores, however, evidenced a reverse pattern in that PIQ 
scores were significantly lower (p < .05^ for children with no risk factors (M = 132.56, SD = 9.05) than 
children with only a language/culture risk factor (M = 134.67, SD = 8.89). Lastly a significant difference 
between VIQ and PIQ (p < .05) was found for children v/ith no risk factors. 

Figure 2: Risk level by WISC- R IQ score interaction. 
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Data for the DCAT were originally collected as percentile scores. These scores were converted 
to Z-scores and analyzed in a 3 (Risk Level) X 3 (DCAT Score: Verbal Spatial, Quantitative) repeated 
measures ANOVA with repeated measures on IX AT Score. For ease of comprehension means, standard 
deviations, and figures are reported in Table 1 in terms of percentiles. There were significant main 
effects for Risk Level, F (2, 1340) = 20.78, p < .001, and DCAT Score, F (2, 2680) = 7.44, p < .001. Newman 
Keul's post-hoc comparisons showed significant differences between no risk children and those with a 
language /culture risk (p < .01) or a two risks (p < .01). Comparisons for the main effect of DCAT scores 
indicated that the spatial subtest scores were significantly lower (p < .05) than both the verbal and 
quantitative subtests. 



Figure 3: Risk level by DCAT subtest score interaction. 
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The Risk X DCAT Score interaction was also significant, F (6, 2680) = 9.09, p < .001. Figure 3 
illustrates this effect. As confirmed by Newman-Keuls post hoc multiple comparisons, children with 
two risks were impaired on all three DCAT measures of academic aptitude relative to children with no 
risk factors (p < .05). AdditionaUy, those with only language/culhire risk were significantly lower in 
verbal and spatial aptihide than the no risk group (p < .05). Lastly, tlie language/culhire risk group 
scored significantly higher in spatial aptitude (p < .05) than those with two risks. 



Table 1 

Means and Standard Deviations ofZ-Scores and Percentiles for the DCAT 





DCAT Verbal 


DCAT Spatial 


DCAT Qualitative 


Marginals 




Z 
(SD) 


(SD) 


z 

(SD) 


Pprcpntilp 
(SD) 


z 

(SD) 


Percentile 
(SD) 


Z 
(SD) 


Percentile 
(SD) 


No Risk* 


1.39 
(0.78) 


87.56 
(12.90) 


1.23 
(0.76) 


85.20 
(15.87) 


1.28 
(0.70) 


85.95 
(14.69) 


1.30 
(0.73) 


86.24 
(14.48) 


Ljing/Cult 
Risk** 


0.97 
(0.81) 


80.71 
(18.43) 


1.03 
(0.86) 


83.50 
(18.05) 


1.24 
(0.77) 


86.57 
(15.72) 


1.08 
(0.81) 


83.58 
(17.40) 


Lang/Cult + 0.89 
Another Risk*** (0.88) 


78.37 
(20.65) 


0.88 
(0.89) 


78.79 
(21.80) 


1.14 
(0.79) 


83.11 
(20.44) 


• 0.97 
: (0.85) 


80.09 
(20.96) 


Marginals 


1.35 
(0.78) 


85.57 
(15.21) 


1.25 
(0.77) 


84.24 
(17.06) 


1.32 
(0.73) 


85.71 
(15.61) 





* n = 1331 
« = 305 
n = 329 



For those with no risk factors, verbal scores were significantly higher than spatial (p <.01) and 
quantitative scores. Children with a language/culture risk factor demonstrated the same pattern as 
childrcn with two risk factors in that verbal scores for both groups were significantly depressed compared 
to their quantitative scores (p < .05). Additionally, for the group with two risk factors, quantitative 
scores were sigiuficantly higher than spatial scores. 

Data for CTBS Scores were analyzed in a 3 (Risk Level) X 3 (CTBS Score: Total Language, Total 
Reading, Total Math) repeated measures ANOVA with repeated measures on CTBS Score. As with the 
previous ANOVA's, there were significant main effects for Risk Level, F (2, 879) = 9.44, p < .001, and 
CTBS Score, F (2, 1758) = 103.16, p < .001. The Risk Uvel by CTBS Score interaction was also significant, 
F(4, 1758) =3.49,p<.0L 

The main effect for Risk Level showed that children with no risk factors (M = 7.89, SD = 1.05) 
performed at a significantly higher level (p < .05) than those with two risk factors (M = 7.56, SD = 1.21). 
The main effect for CTBS showed that CTBS Total Math Scores (M = 8.25, SD = 1.05) were significantly 
higher (p < .01) than both Total Reading (M = 7.52, SD = 1.08) and Total Language scores (M = 7.68, 
SD = 1.10). 
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Figure 4: Risk level by CTBS subtest score interaction. 
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Figure 4 illustrates the Risk Level X CTBS Score interaction. As confirmed by Newman-Keuls 
post hoc multiple comparisons, both language (M = 7.45, SD = 1.20) and reading subtest scores (M = 
7 38 SD = 1.08) were significantly (p < .05) depressed for the laiiguage/culture risk factor group when 
cotr^pared to the no risk group (language: M = 7.74, SD = 1.07; reading: M = 7.59, SD = 1.05). ChUdren 
with two risk factors had significantly (p < .05) reduced scores in all three areas of achievement (language: 
M = 7.48, SD = 1.19; reading: M = 7.07, SD = 1.36; math: M = 8.08, SD = 1.08) when compared to the no 
risk group (math: M = 8.27, SD = 1.05). Children in the no risk group and the two risk group differed 
significantly in all three areas of achievement {p < .01). Math achievement was the highest, followed by 
language, then reading achievement scores. For children in the language/culture group, math scores 
were sigriificantly higher than the language and reading scores {p < .01). 



Discussion 
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We evaluated a large sample of gifted children across three levels of risk: no risk, language/ 
culture risk only, and two risk factors (as evidenced by a language/culture risk plus one additional risk 
factor). The children were compared in terms of their pattern of intelligence test scores (i.e., for the VIQ 
and PIQ of the WISC-R), academic aptitude (DCAT), and academic achievement (CTBS). 

Children with only language /culture as a risk factor consistently showed disadvantage, when 
compared to children with no risk, in verbal domains as opposed to nonverbal. These children showed 
sUghtly lower Verbal IQ scores and sUghtly higher Performance IQ scores than the no risk group. They 
additionally demonstrated their highest aptitude and achievement scores in the quantitative/math 
area while performing below the no risk group in areas of verbal aptitude, language achievement, and 
reading achievement. From these data, it appears that children who are in the process of becommg 
bilingual demonstrate their giftedness through performance based tasks, such as quantitative/math 
type skills, in order to compensate for their verbal disadvantage. This finding is consistent with the 
theory that children in the process of becoming bilingual practice working with abstract symbol systems 
because they must switch language codes depending upon the given situation pash & Mishra, 1988; 
Reynolds, 1991). Consequently, these children have an advantage in the quantitahve/math area, smce 
the study of these subjects is simply another abstract symbol system. 
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Children with two risk factors demonstrated the same pattern of test scores as the language/ 
culture risk children. The results are more devastating for the high risk group, however, because their 
scores were even more depressed compared to the no risk children. 

Overall, the data indicate that language/culture acts as a significant risk factor for those children 
evaluated for gifted education. Because of limited English skills when tested, these children may often 
be excluded from programs in which they would excel if they were competent in classroom English. 
These data support Cummin's (1976) subtractive hypothesis that while these children are becoming 
competent in English, their bilingualism hinders the complete expression of their abilities. The language/ 
culture risk children who were certified gifted with the WISC-R in this sample might actually be expected 
to show improvements in their IQ scores once English proficiency is obtained, given the generally high 
Performance IQs. There m.ay be an even larger number of children who have been excluded from 
gifted education because their Performance IQs could not compensate for their current level of Verbal 
IQ performance, when verbal tests such as the WISC-R are used as the criterion for giftedness. 

Children who are not native English speakers in the U.S. are often at a disadvantage in the 
educational system because o: their lack of verbal proficiency Present data indicate, however, that the 
strengths of these children can be found in nonverbal areas. The data also highlight the special needs 
and vulneral ^ilities that children with language risk have in all areas of verbal ability. Are these strengths 
and weaknesses only indicative of an English language deficiency, or are there additional variables, 
perhaps involving cultural discrepancies, that account for the language/culture risk group's patterns 
of test results? Once flu^^ncy is obtained in English for these at risk children, do their test patterns more 
closely resemble these of no risk children? Further research is needed to shed light on these questions. 
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Abstract 



One hundred and sixty children were evaluated in a battery of four 
iiiformation-'processing tasks: Inspection Tune (backward masking pcuradigm). 
Reaction Time, Coincidence Tuning, and Mental Counters (Working Memory). 
Half of the children were certified as gifted in a case study analysis; half were 
selected from the nongifted program in the same school district. Within each 
group (gifted vs. nongifted), half were in the 2nd-3rd grade, half in the 5th- 
6th grade. Finally, for each of the two main factors (giftedness and grade), 
there were an equal number of children from four ethnic backgrounds: 
African-American, Lalmo, Filipino, and White. There were large differences 
on all four information-processing tasks as a function of grade and 
membership in the gifted program. Only one significant interaction occurred 
involving ethruc background, in which gifted African- Americans showed the 
fastest RT's and nongifted African- Americans the slowest. Regression analysis 
revealed that measures of speed of processing, particularly Ir\spection Tmie, 
were the primary correlates of both IQ and membership in tiie gifted program. 
Overall, however, the relationship between the measures of processing and 
IQ were modest. Implications of these findings are discussed. 
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Information-Processing in Gifted vs. Nongifted African-American, Latino, Filipino, and Wl\ite 
Children: Speeded vs. Nonspeeded Paradigms 



A relatively little-used nontraditional method of selecting children from diverse backgrounds 
for gifted programs involves the analysis of information-processing abilities (Grinder, 1985; Sternberg, 
1981). As Horowitz and O'Brien (1986) noted, "If different subcultures in the U. S. foster different styles 
of thinking on different strategies of information processing, then it should be possible to identify and 
describe these for each population" (p. 1148). Alternately, measures of information-processing may 
provide an vmbiased method of selecting for giftedness. 

Wagner and Sternberg (1984) identified information-processing as one of three main approaches 
to the concept of intelligence. The other two were the psychometric approach, which uses traditional 
standardized tests, and the Piagetian approach, which is based on Piaget's theory of cognitive 
development. In the information-processing approach, researchers attempt to analyze responses m 
terms of the basic component processes that underUe them. For example, information-processing begins 
initiaUy with input of external stimulation. This input is then stored or held in a short-term storage or 
working memory system while analytical processes are performed. The results of this analysis are 
subsequently transferred to otlier systems, such as long-term memory, where new incoming information 
can be compared to one's present store of knowledge so that an appropriate response can be made. 
Theoretically, faster or more efficient information processors are better able to learn and to solve problems. 
Indeed, reviews of an extensive Uterahire have supported the view that the speed or efficiency with 
which an individual can execute a smaU number of basic cognitive processes is highly related to one's 
performance on psychometric tests of inteUigence such as the Wechsler IntelUgence Scaies and Raven 
Progressive Matrices Test Qensen, 1982; Jensen & Reed, 1990; Urson & Rimland, 1984; Vernon, 1987; 
Vernon, Nador, & Kantor, 1985). 

Thus far, two main variations of the information-processing approach have been advanced. 
Sternberg's (1981) theory emphasizes complex processes— "metacomponential" or executive skills, such 
as problem recognition, process selection, strategy selection, and solution monitoring. Sternberg's theory 
stresses the role of the ability to make inferences and apply previously made inferences to new domams, 
and of learning skills such as encoding and retrieval of information from long-term memory storage 
(Sternberg & Davidson, 1983). Although promising, this approach presently lacks a standard and widely 
accepted set of tasks to evaluate the various stages of processing. In addition, many of the skills are 
highly dependent on vorbal ability, which may make them less suitable in selecting disadvantaged 
child ien. 

A second infom\ation-processing approach, the one evaluated in the present study, attempts to 
tap into a basic ability that theoreticaUy underUes performance on more complex tasks through the use 
of elementary cognitive tasks (Hunt, 1978; Jensen, 1982, 1987) that evaluate speed of informahon- 
processing. According to this view, gifted children are faster in their ability to encode and manipulate 
environmental input and to retrieve and analyze existing knowledge. Consistent with this speed of 
processing theory, Saccuzzo, Larson, and Rimland (1986) found that several measures of visual and 
auditory speed of processing, which contained Uttle or no complex problem solving skills and required 
minimal language skills (only the ability to understand instructions) shared significant common yanance 
with conventional standardized psychometric tests that did contain a high degree of inteUechial content 
and involved complex problem solving. 
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Empirical support for a relationship between processing speed and individual 
inteUigence has come from reaction time shidies that manipulate the level of unccrtn.nt;; to whid^a 
Tub^ct must respond Qensen, 1979; Jensen & Munro, 1979; Jensen & Reed, 1990; ^u-nebor^ 1978; 
Smith & Stanley, 1983; Vernon, 1981). Using parameters such as median reaction time, slope of reaction 
time as a function of the number of bits, and intraindividual standard deviations of reaction time 
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performance, irwestigators have reported large differences between retarded persons and those of 
normal IQ, as well as between vocational-coUege students and university students Qensen 1980; 1982). 
Based on his own findings and a survey of the Uterature, Jensen (1982) estimated the correlation between 
reaction time and individual differences on IQ tests to be between -.3 and -.4. The correlations vary 
widely across samples, however (Limneborg, 1978). 

A number of investigators have found support for a relationship between speed of processing 
and giftedness (Span & Overtoom-Corsmit, 1986). Cohn, Carlson, and Jensen (1985), for example, 
found that gifted children differed fundamentaUy from average children in their speed of information- 
processing as evaluated in a reaction time paradigm. Cohn et al. (1985) compared a group of "bnght- 
average" 7th-grade children to a group of academically gifted children of coniparable age who were 
taking coUege-level courses in mathematics and science. Large and significant group differences were 
found on each of nine elementary reaction-time measures of speed of information-processing. 

A second line of investigation, the study of speed of visual information-processing, also has 
supported a relationship between processing speed and performance on complex cognitive tasks. In 
the typical visual paradigm, the subject makes a discrimination for a briefly exposed "target" stimulus, 
such as identifying which of two lines presented to the right and left of cenb-al fixation is longer. The 
target stimulus is followed by a spatiaUy overlapping non-informational mask ( e.g., a umform line 
that completely superimposes the lines of the target stimulus). An extensive Uterature on the masking 
task itself reveals that it limits the duration that the informational impulse provided by the target is 
available for processing in the central nervous system (Felsten & Wasserman, 1980). Speed of processing, 
or "Inspection Time" as it is usually called (Vickers, Nettelbeck, & Willson, 197?), is estimated by either 
systematicaUy varying the exposure duration of the target and estimating the minimum duration needed 
for criterion accuracy (Lally & Nettlebeck, 1977; Nettelbeck & LaUy 1976), or by keeping the stimulus 
duration constant and varying the interval between the target and mask (Saccuzzo, Kerr, Marcus, & 
Brown, 1979; Saccuzzo & Marcus, 1983). 

Numerous studies have reported a statisticaUy significant difference between mentally retarded 
and non-retarded (average IQ) individuals in inspection time as evaluated in a backward masking 
paradigm. Such differences occur in spite of wide variations in the nahire of ihe stimulus, method of 
stimulus presentation, and technique used to estimate visual processing speed (Saccuzzo & Michael, 
1984). There are, moreover, clear<ut developmental differences. The general finding is a direct 
relationship between chronological as weU as mental age and performance (Blake, 1974; Liss & Haith, 
1970; Saccuzzo et al., 1979). Finally the evidence supports a significant relationship between degrees 
of normal intelligence and visual processing speed; however, ihe magnihide of the relationship remains 
controversial (Mackintosh, 1981; Nettelbeck, 1982). 

Though an eariy study reported an astonishing -.92 correlation between scores on the 
Performance Scale of the Wechsler Adult Intelligence Scale (WAIS) and Inspection Time (Nettelbeck & 
Lally 1976), most subsequent investigators found a less spectacular, but significant, relationship between 
Inspection Time and intelligence, with a median correlation of about -.45. These positive findings have 
been criticized, however, on metiiodological grounds-«mall sample sizes (usually no more than 25 
subjects); the inclusion of mentally retarded persons, which greatly inflates the correlation due to the 
exb-emely disparate range of performance relative to the sample size; and analyses based only on 
extreme scoring subjects, which, again, is weU-known to inflate correlations (Irwin, 1984; Nettelbeck, 
1982). 

Nettelbeck (1982) took a carehil look at his own and others' work in tl^ie area. Nettelbeck's 
analysis revealed a relatively small but consistent association between inteUigence and Inspection Time. 
Irwin (1984) similarly found modest but significant correlations between Inspection Time and mtelligence 
test performance. More recently, Nettisbeck (1987) provided an estimate of -.50 as the relationship 
between Inspection Tune (IT) and intelligence. This estimate was subsequently confirmed by Krai^zler 
and Jensen (1989) in a meta-analysis. Based on an extensive Uterahire review and meta-analytic 
procedures, Kranzler and Jensen estimated that for adults, with general measures of IQ, the IT- 
intelligence correlation is about -.54 after correction for tiie effects of artifactual sources of error, and 



-.30 prior to correction. Despite these promising findings, more work is needed to determine if gifted 
children can be distinguished from nongift ■:" children on IT tasks, and whether they can be so 
distinguished in an unbiased maimer. 

A few studies have attempted to examine racial differences in speed of information processing. 
In a reaction time study, Lynn and Holmshaw (1990) compared 350 black Soutii African 9-year-old 
children with 239 white British children on 12 reaction time tests. While tiie black children had slower 
decision times and greater variability than the white children, there were also bemendous IQ differences 
between the groups. The black children's mean Raven IQ corresponded to tiie first percentile and was 
equivalent to an IQ of about 65. The white children, by contiast, were in the 56tii percentile, wi\h an 
equivalent Raven IQ of 102. Because the black sample in Uns shidy was lower in IQ than is generally 
found for U.S. samples, these results have Uttie, if any, generalizabiUty to American black and white 
populations. 

Astiidy of racial differences in a backward masking paradigm is similarly limited. Bosco (1972) 
compared \he performance of first- and sixth-grade black and white school children. There were clear 
differences in socioeconomic status between the whites, who were selected from a suburban area, and 
the blacks, who were selected from ihe inner city. Since race and socioeconomic background were 
confounded, the issue of \he relatior\ship between race and IT was imresolved. 

To date stiidies of information-processing and intelligence have focused on speed, with relatively 
Uttie attention given to other information-processing tasks that might also underUe intelUgent behavior. 
One such task is coincidence timing (CT), a task tiiat requires subjects to respond at the mstant two 
objects intersect or "coincide" (Dunham, 1977; Poulton, 1950). 

Smittx and McPhee (1987) traced tiie history of coincidence timing (Dorfman, 1977; Poulton, 
1950- Thomas, GaUagher, & Purvis, 1981). As Smith and McPhee noted, coincidence timing relates to 
such' everyday tasks as stepping on and off escalators, picking up an object on a conveyor belt, and 
predicting when one's chai.ging of lanes on the freeway will coincide with a gap m tiaffic. In more 
primitive societies, coincidence timing also had survival value, as in predicting where to aim a spear to 
hit a moving animal. Coincidence timing tasks require subjects to attend to changing conditions, mtegrate 
information over time, and use that information to predict a fuhire event (Smith & McPhee, 1987). 

Smith and McPhee conducted the first published attempt to determine if a correlation sxiste 
between psychometric intelligence, as evaluated by Hve Standard Raven Progressive Matnces Test, and 
a coincidence timing task. These investigators administered a 10 minute CT task to 56 males and females 
of "high" to "moderate to high" socioeconomic stahis. Subjects were required to press a key at the very 
moment a moving target touched (coincided with) a stationary line. There was a significant negative 
correlation (- 294) between ttxe number of errors on \he CT task and Raven scores. In addition, there 
was a significant negative correlation (-.359) between intrasubject standard deviation (consistency of 
performance) and Raven scores. 

As Larson (1989) noted, the correlation between Raven scores and coincidence timing adds a 
new dimension to the well-known correlation between psychometric intelligence and information- 
processing tasks in that, unlike previous tasks such as reaction time, coincidence timing does not requ^ 
speed of processing, but rather attention and estimation. Thus, ttxe task has potential for addmg to ttie 
ranee of relatively simple tasksdevoid of inteUectualcontentthatmay be rektedto,andp^^^^ 
tSf involvingcomplex problem solvingsuchas the Raven. Larson(1989)confinnedSmittx and 
finding of a significant relationship between \he CT task and psychometnc mteUigence as measured 
by tS Armed Forces Qualifying Test (AFQT), in a group of 127 male Navy recruits. To date however 
it has yet to be determined if a coincidence timing task can distinguish gifted and nongifted children, or 
whether there are ethnic differences in this skill. 

A question raised by Larson (1989) is whetixer some variable might underUe performance on 
reaction time, inspection time, and coincidence timing. One such common variable, accordmg to Larson 
may be working memory-ttxe hypothetical cognitive work space for problem solvmg. As Larson 
(1989) noted working memory provides a theoretical bridge between simple cogmtive tasks and 



psychometric tests, based on concepts such as "representational agility and/or fidelity" (p. 366). In the 
present study, we attempted to provide a direct test of Larson's theoretical bridge hypothesis through 
the use of a microcomputerized task of working memory called mental counters (Larson, 1986). 

Method 

Subjects: 

Eighty children who had been certified as gifted by a school psychologist were compared to a 
matched sample of eighty nongifted children. For each of these two samples (gifted avA nongifted) 
there were forty 2nd- to 3rd-grade children and forty 5th- to 6th-grade children. Each of the four 
subgroups of forty children had 10 African-American, 10 Filipino, 10 Latino/Hispanic, and 10 White 
children. The nongifted children were matched to the gifted children on the basis of age, race, and 
school district. 

Procedure: 

Giftedness was determined individually for each child by a school psychologist in a 
comprehensive case study analysis. This analysis considered recommendations by parents or teachers, 
a behavior checklist, achievement, standardized tests scores, and the presence of risk factors including 
economic disadvantage, cultural differences, English as a second language, and negative environments. 
Each child was given a battery of microcomputerized tests as follows: Inspection Time (IT), Choice 
Reaction Time (CRT), Coincidence Timing (CT), and Mental Counters (MC). These tests were presented 
on an IBM PC/XT microcomputer with a black and white monitor and standard keyboard. The tests 
were admirdstered in counterbalanced order in one session, which lasted approximately one hour. 
Subjects were administered a standard Raven in a separate, second session. All subjects, and their 
parents, provided voluntary written informed consent for their participation. Nevertheless, one school 
district refused to allow the administration of the Raven; 18 children (11 gifted) did not receive the 
Raven. The specific parameters for each task in the information-processing battery are described below. 

Inspection Time C/T). The inspection time (IT) task was a non-adaptive procedure based on the 
methods of Larson and Rimland (1984) and Saccuzzo and Larson (1987) and described in detail by 
Larson and Saccuzzo (1989). A target stimulus consisting of two horizontal lines of unequal length (17.5 
mm and 14.3 mm) was briefly presented in the center of the computer monitor. The two lines appeared 
to the right and left of central fixation, with the longer appearing right or left on a random basis. 
Immediately following termination of the target, a backward visual noise mask, consisting of a single 
horizontal line tl\at completely superimposed spatially on the target, was presented. The mask is known 
to limit the duration of the sensory signal delivered to the central nervous system by the target (Felsten 
& Wasserman, 1980). Targets were presented at one of five completely randomized stimulus durations: 
16.7, 33.4, 66.8, 102.2, and 150.3 msec, which corresponded to 1, 2, 4, 6, and 9 refresh cycles on the video 
monitor. There were 10 trials per stimulus duration, for a total of 50 trials. The subject's task was to 
make a forced-choice discrimination, indicating which of the two lines of the target is longer, by pressing 
one of two keys on the microcomputer keyboard. The task began with a set of instructions, examples, 
and practice to criterion prior to the test proper. Subjects were given computer-generated visual feedback 
on their performance. 

Choice Reaction Time (Hick Paradigm). A Hick paradigm for a 1,3, and 5 choice reaction time task, 
as described by Saccuzzo et al. (1986) and Larson and Saccuzzo (1989), was used to evaluate choice 
reaction time performance. A horizontal arrangement of lights was presented at the bottom of the 
monitor. All subjects were presented with l-,3-, and 5-choice conditions, with order of presentation 
completely randomized. Open squares on the monitor were used as stimulus lights. Subjects responded 
by pressing the space bar as soon as a square was illuminated. The subject's forefinger rested lightly on 
the space bar, so that there was essentially no movement time involved. Previous research has shown 
that this "no movement" reaction time task is as effective as more traditional reaction time tasks involving 
movement (Kostas, Saccuzzo, Larson, 1987), ai\d has the advantage of minimizing errors that occur due 
to the necessity of pressing two keys in the movement time paradigm. In this procedure, the subject 
views the monitor on which there are one, three, or five line drawn squares. After a random period of 
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time from 1.5 to 2.5 seconds, one of the squares is illuminated. Reaction time is defined as the number 
of milliseconds between the onset of the stimulus (i.e., where one of the stimulus squares is illuminated) 
and the instant the subject presses the space bar. 

Coincidence Timing. The Coincidence Timing task was identical to that used by Larson (1989), 
based on the description provided by Smith and McPhee (1987). For each of three conditions, the 
subject's task was to press the space bar on the computer keyboard at the exact moment that a horizontally 
moving dot crossed a vertical line in the middle of the monitor. Condition 1 consisted of a dot that 
moved in a straight horizontal line at the speed of 0.10 meters per second, with random delays in the 
starting time. Condition 2 was identical to Condition 1 but at a speed of 0.15 meters per second. In 
Condition 3, the path of the dot was random (jagged), with a random delay and speed of 0.10 meters 
per second. The total distance traversed by the moving dot from orgin to crossing the line was 0.13 
meters. 

As in Larson (1989), there were 30 trials at each condition. Each trial consisted of a cycle in 
which the dot moved left to right across the screen, then right to left so that the dot crossed the centerline 
twice Finally, since skill in tasks such as coincidence timing may be related to the type of skills that 
children develop playing video games such as Nintendo (Salthouse & Prill, 1983), each of the 160 children 
in the study were asked to estimate the number of hours per week that they spend playing video games 
in a self-report procedure prior to implementation of the information-processing tasks. 

Mental Counters (MC). In the Mental Coimters(MC) task (Larson, 1986), subjects are asked to 
keep track of the values of three independent "counters" that change rapidly and in random order. The 
task requires subjects to simultaneously hold, revise, and store three counter values that change rapidly 
The counters themselves are represented as dashes on the video monitor (three side-by-side horizontal 
dashes in the center of the screen). The initial counter values are zero (0,0,0). When a small target (0.25 
inch, hoUow box) appears above one of the three dashes, the corresponding counter must be adjusted 
by adding one. When the target appears below one of the three dashes, the corresponding counter 
must be adjusted by subtracting one. The test items vary both in the number of targets and the rate of 
presentation. In the present study there were two different rates of presentation (0.633 seconds and 
1 42 seconds), and 8 targets, such that the values of the initial counters changed 8 times. Order of 
presentation of speeds was counterbalanced. Prior to the test proper, subjects were given instructions, 
examples, and practice to criterion (three consecutive correct responses). The maximum and minimum 
counter values varied between +3 and -3, respectively. The subject's task was to select, from among 
four choices, the correct list of final values for the three counters. Selection was made by pressmg the 
proper key on the keyboard. Feedback was given only during practice, and not during the test proper. 



Results 



Video Games. 

Data for self-reported hours per week spent playing video games such as Nintendo were 
evaluated in a 2(GATE: gifted vs nongifted) X 4(Ethnic Background) ANOVA. Tl^e only significant 
finding was a main effect for GATE, F(l,152) = 4.208, p < .042. TWs result revealed that the gifted 
children spent significantly less time per week (about half) playing video games than the nongifted 
children with means of 3.82 hours vs 6.04 hours, respectively. Thus, if prior practice at such tasks did 
make a difference, it would have been far in favor of the nongifted children, since they spent much 
more time playing video games. Notably, there were no ethnic differences in the number of hours 
spent playing these games. 
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Raven. 



Table 1 shows the Raven scores for gifted vs nongifted children as a function of ethnic 
background. The table shows the average Raven Z scores based on the U.S. smoothed norms provided 
by Raven et al (1986). These data were subjected to a 2(GATE) X 2(Grade) X 4(Ethnic Background) 
ANOVA. As might be expected, there was a significant main effect, F(l,135) = 42.69, p < .001, for Gifted 
(M = 1.11, SD = .92) vs Nongifted(M = .02, SD = 1.0) children. The only other significant finding was a 
GATE X Ethnic Background interaction, F(3,135) = 358, p < .05. Newman-Keuls post hoc multiple 
comparison tests of this difference revealed the gifted White, African- American, and Filipino children 
did not differ significantly among themselves and all three of these groups were significantly higher 
than the gifted Latino children and each of the four nongifted groups, none of whom differed significantly 
among diemselves. It should be noted, however, that of the 11 gifted children who did not receive 
Raven, 6 were in the Latino group. The absence of these children may have artificially lowered the 
overall mean for the Latino children. 

Table 1. 

Raven Z-Scores as a Function of Ethnic Background for Gifted versus Nongifted Children 



Latino /Hispaiuc White African- American Filipino 

Gifted Nongifted Gifted Nongifted Gifted Nongifted Gifted Nongifted 

M 0.51 0.36 1.36 0.01 1.34 -0.38 1.09 0.16 

SD 1.17 0.79 0.78 0.81 0.58 0.92 0.95 1.31 



For each of the four information-processsing variables, all F values, sigruf icance levels, means, 
and standard deviations are presented in Table 2 to conserve space. Whereas Grade reflects differences 
between 5th and 6th versus 2nd and 3rd graders, GATE reflects differences between Gifted versus 
Nongifted children, and Condition reflects the condition under study, such as Stimulus Ehiration, Choices, 
and Fast vs. Slow. 
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Table 2 

Summary of significant findings 



Significant Effects 


df 


F 


Level 


M 


SD 


Inspection Time 






Dtn " otn Larade 




1 1 Q 


Main Effect: Grade 


1/140 


36.73*** 


OO.i/ 






2nd - 3rd Grade 


57.50 


14.00 


Main Effect: GATE 


1/140 


7.84** 


Gifted 


63.78 


14.62 








Nongifted 






Main Effect: 


4/560 


80.12*** 


Duration 1 


52 


10.7 


Stimulus Duration 






Duration 2 


57 


13.4 








Duration 3 


0/ 


15.7 








Duration 4 


OO 


i/.Z 








Duration 5 


74 


18.3 


Grade X Inspection Time 


4/560 


8.73*** 








GATE X Inspection Time 


4/560 


3.76** 








Reaction Time (msec) 










90.55 


Main Effect: Grade 


1/135 


56.83*** 


5th - 6th Grade 


383 








- oTd Larade 




1 'VX A7 


Mam Effect: GATE 


1 /1 oc 

1/135 


O.Zi 




422 


113.75 








Nongifted 


467 


139.70 


Main Effect: Choices 


2/278 


141.99*** 


One Choice 


384 


117.71 






1 nree v-aioices 


454 


127.01 








Five Choices 




i^i no 




2/278 


4.17* 








GATE X Choices 


2/278 


5.08** 








GATE X Ethnicity 


3/139 


3.23* 








Coincidence Timing (Errors) 








18.34 


7.51 


Main Effect: Grade 


1/140 


36.27*** 


5th - 6th Grade 






2nd - 3rd Grade 


29.16 


1 7 ft7 
i/ .0/ 


Main Effect: GATE 


1/140 


19.95*** 


Gifted 


19.80 


8.89 






Nongifted 


27.73 


18.01 


Main Effect: Condition 


2/280 


94.76*** 


Slow, NonVariable 16.95 


11.39 






Fast, NonVariable 


25.18 


14 60 








Random 


29.32 


1 Q 01 

io.zi 


Grade X Condition 


2/280 


3.75* 








GATE X Condition 


2/280 


3.58* 








Mental Counters 








10.84 


4.11 


Mam Effect: Grade 


1 /in 
1/ llz 


AO no*** 


5th - 6th Grade 






2nd - 3rd Grade 


6.46 


3.76 


Main Effect: GATE 


1/112 


7.08** 


Gifted 


9.68 


4.34 






Nongifted 


8.34 


4.59 


Main Effect: Condition 


1/112 


66.00*** 


Slow 


10.17 


5.07 






Fast 


7.87 


3.95 


Grade X Condition 


1/112 


8.99** 








GATE X Condition 


1/112 


6.41** 
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* p < .05 
p<.01 
p< .001 
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Inspection Time (if). 



The data for IT were analyzed in a 2(Grade) X 2(GATE) X 4 (Ethnic Background) X 5 (Levels 
of Stimulus EHiration) mixed repeated measures ANOVA, with percent correct at each duration used 
as the dependent measure. There were significant main effects for Grade, GATE, and Stimulus 
EHiration (See Table 2). There were also two significcint interactions: Grade X Inspection Time, and 
GATE X Inspection Time. 



Figure 1. Inspection Time: Grade by stimulus duration interaction. 
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Figure 1 illustrates the Grade X 
Inspection Time interaction. As 
inspection of Figure 1 reveals, and as 
confirmed by post-hoc multiple 
comparison tests, the children in 
grades 5-6 had significantly better 
performance (p < .01) at each of the 
five level? of inspection time except 
the first/fastest speed, where both 
groups performed approximately at 
chance level. 
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Figure 2 illustrates the 
GATE X Inspection Time 
(IT) interaction. As this 
figure shows, and as 
confirmed by post-hoc 
multiple comparisons tests, 
the gifted children had 
significantly (p<.01) better 
performance at the two 
slowest speeds (IT4 and 
ITS). The groups did not 
differ significantly (p > .05) 
at the three fastest speeds, 
where the tendency of the 
younger children to 
perform at chance obscured 
differences between gifted 
and nongifted children at 
Speeds 2 (IT2) and 3(IT3), 
and all subjects were at 
chance at the fastest speed 
(ITl). 



Figure 2. Inspection Time: GATE by stimulus duration interaction. 
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Reaction Time. 



The median reaction time for each subject was analyzed in a 2(Grade) X 2(GATE) X 4(Ethnic 
Backgroimd) X 3(Choices) mixed repeated measures ANOVA. There were significant main effects for 
Grade, GATE, and Choices (See Table 2). There also were three significant interaction effects: Grade X 
Choices, GATE X Choices, and GATE X Ethnicity. 

Figure 3. Reaction Time: Grade by level of choice interaction. 



Figure 3 illustrates the 
Grade X Choices 
interaction. While the 
children in grades 5-6 
outperformed the children 
ii\ grades 2-3 at each level 
of choice, the difference 
between the groups 
increased as the number of 
choices increased. 
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Figure 4. Reaction Tmie: Gate by level of choice interaction. 
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Figure 4 illustrates the 
GATE X CHOICES 
interaction. The groups 
did not differ (p > .05) at the 
one choice condition, but 
were significantly diiferent 
at the 3 and 5 choice 
conditions (p < .01). 
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Figure 5. Reaction Time: GATE by ethnicity interaction for reaction time median. 
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Figure 5 illustrates the 
GATE X Ethnicity interaction. Post- 
hoc multiple comparison tests of this 
effect revealed a statistically 
significant difference between gifted 
African- Americans and the 
nongifted African-Americans. The 
other differences between gifted and 
nongifted groups for each ethnic 
backgroimd did not reach statistical 
sigruficance. 
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The individual variability of reaction time (RT Variance) was also analyzed in a 2(Grade) X 
2(GATE) X 4{Ethnic Background) X 3(Choices) mbced repeated ANOVA. For this analysis, the only 
significant finding was the main effect for Grade, F(l,138) = 11.02, p < .001. The GATE X Ethmc 
Background effect did not reach statistical significance (p > .081). 

Coincidence Timing (CT). 

Two dependent measures were used to evaluate the coincidence timing data: Coincidence 
Tunine Errors (CTE), which refers to the mean of the absolute value of the difference between the 
respoi^ position and the true position of the line; and Coincidence Tmiing Stai^dard Deviation ^D) 
wWch refers to the standard deviation of the distribution of response positions. The error (CTE) and 
Standard Deviation (CTSD) data were separately analyzed in a 2(Grade) X 2(GATE) X 4(Ethnic 
Background) X 3(Conditions) mixed repeated measures ANOVA. Resulfs were nearly identical for 
both dependent measures. For CTE there were significant main effects for Grade, GATE, and Condition 
(See Table 2). There were two significant interactions in the CTE data, the Grade X Condition, and the 
GATE X Condition. 



Figure 6. 



Grade by condition interaction (CTE). 



Figure 6 illustrates the Grade 
by Condition interaction for 
the Coincidence Timing 
errors. In this interaction, 
while the children in grades 
5-6 outperformed children in 
grades 2-3 at all levels, they 
did so at a greater rate for the 
variable condition (Condition 
3). 
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Figure 7. Gate by condition interaction (CTE). 



40r 



o 

2 
< 
w 
2 



30 



20 




Figure 7 illustrates the GATE X 
Condition interaction, and again 
shows the greatest difference 
occurring between the groups 
under the variable condition 
(Condition 3). 



GIFTED 
NONGIFTED 



1 



2 3 
CONDITION 



The analysis of standard deviations produced all tlie same main effects as weU as a GATE X 
Condition interaction. The only difference between the two measures was that the Grade X Condition 
interaction did not reach statistical significance for the standard deviation data. For both CT measures, 
there were no main or interaction effects involving ethnicity. 
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Mental Counters, 



For the Mental Counters Test, the number of correct responses was ai\alyzed in a 2(Grade) X 
2(GATE) X 4(Ethnic Background) X 2(Conditions - Fast and Slow) mixed repeated measures ANOVA. 
Results were parallel to those obtained with Inspection Tmte and Coincidence Timing. There were the 
familiar main effects for Grade, GATE, and Condition. As with the prior analyses, there were two 
significant interactions: Grade X Condition, and GATE X Condition. Figures 8 and 9 illustrate these 
interactions. 



Figure 8. Mental Coxmters: Grade by condition interaction. 
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In the Grade X Condition 
interaction, as shown in 
Figure 8, the children in 
grades 5-6 outperformed 
children in grades 2-3 at both 
conditions, but did so at a 
greater rate for the slow 
condition. 
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Figure 9. Mental Counters: Gate by condition interaction. 
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In the GATE by Condition 
interaction, as shown in 
Figure 9, the differences 
between the groups were 
significant (p < .01) only at 
the slow speed. 
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Oiher Analyses. 



Multiple regression analysis was used to determine the information-processing predictors of 
Raven IQ scores and of placement in the GATE program. First, the following variables were used to 
predict Raven scores in a stepwise multiple regression: the five levels of inspection time, (ITl, IT2...IT5); 
each of the three reaction time choices for botl\ variability and median RT; each of the three conditions 
of coincidence timing for both errors (CTE) and standard deviation (CTSD); and th e two levels of mental 
counters. Only one of these information-processing variables, IT4(Inspection Time, duration 4) entered 
into the equation and produced a significant F value, Fa,102) = 13.38, p < .0004. The multiple R between 
IT4 and the Raven was .34. Next, each of the above information-processing variables were used to 
predict GATE membership in a stepwise regression analysis. Three variables were significant in 
predicting GATE status: ITS, F(l, 118) = 8.45, p < .005, CTSD(condition 3), F2, 117 = 6.65, p < .002, and 
reaction time variance (5 choice condition), F(3, 116 = 6.13), p < .001. The multiple R between IT5 and 
GATE membership was .258. Adding in CTSD (condition 3) increased the multiple R to .319. With the 
addition of reaction time variance (5 choice condition), the multiple R increased to .370. 

Finally, an attempt was made to determine a possible cut-off score for the information-processing 
tasks to discriminate gifted from nongif ted children. Given that IT5 was the best predictor of GATE 
membership, we began with this variable. First, a discriminant analysis was conducted to determine 
the best score to discriminate the gifted vs the nongifted children. For IT5, this score was 73.5 percent 
correct. Next, frequency tables were constructed to determine the frequency of gifted vs nongifted 
children who scored above or below the cut-off. Twenty-one out of 77 gifted children (27 percent) 
scored below the cut-off; thirty-two out of 79 nongifted children (40 percent) scored above the cut-off. 
Results were similar for other infom^ation-processing tasks, which suggested that the use of these tasks 
to make individual decisions would indeed be hazardous. 



Discussion 



The present study adds to the literature in being the first to examine the information-processing 
abilities of children and the relationship between IQ and information-processing as a function of three 
major variables: grade (age), giftedness (as determined by an individual case study analysis by a school 
psychologist), and ethnic background. Whereas most of the relevant studies in this field are restricted 
to one or at most two measures of information-processing, the present study examined four different 
measures, two of which depended heavily on speed of processing (IT and RT) and two of which did not 
depend exclusively on speed. All information-processing tasks, however, can best be descnbed as 
elementary cogniHve tasks (ECTs), in that they are essentiaUy devoid of complex content and problem 
solving skills. 

AU of the tasks easily discriminated older children as a group versus younger chUdren. This 
finding is consistent with known developmental differences in choice reaction time and backward 
masking paradigms, and further shows that such differences can be extended to coincidence hmmg 
and mental counters, which is believed to reflect working memory capacity. 

The analyses of gifted versus nongifted differences in specific information-processing abilities 
yielded a number of critical interaction effects. For inspection time, gifted and nongifted children did 
not differ at the faster stimulus duraHons, where performance for both groups remained close to chance. 
As the stimulus duration increased, significant differences between the groups emerged. These 
differences are consistent with faster information-processing in the gifted children. In the reaction tune 
paradigm the differences between gifted and nongifted children occurred only for the three and five 
choice conditions, and were greater as the number of choices increased. Again, this significant interaction 
is consistent with faster processing (i.e., faster decision time) in the gifted group. Similarly, the gieatest 
difference between gifted and nongifted children in coincidence timing occurred for the most variable 
condition. In the mental counters task, the fast condition was too difficult for aU subjects; chance 
performance was the result for both groups. In the slower condition, however, the gifted children 
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cVarly outperformed the nongifted children. In sum, excluding simple reaction time and a level of 
difficulty so great that virtually all subjects were at chance, the gifted children showed a general 
superiority across all four elementary cognitive tasks. 

The superiority of the gifted children was essentially independent of ethnic background. There 
were no ethnic differences for inspection time, coincidence timing, and mental counters. Median reaction 
data did reveal a significant interaction between ethnic background and GATE membership. The gifted 
African- Americans had the fastest reaction times of all, whereas the nongifted African-Americans had 
the slowest. This result is of interest in that previous RT studies with African- Americans have studied 
only low or, at best, low-average to average IQ African- Americans. The general result has been slower 
choice RT's in the African-Americans compared to Whites. The present results show that among gifted 
African-Americans, reaction times are at least comparable, if not faster, than among other ethnic 
backgrounds. This finding warrants further investigation as it is suggestive of two different subgroups 
of African- Americar^ based on reaction time. 

Regression analj^ses were conducted to determine if the relationships between information- 
processing and IQ and between information-processing and GATE membership are due to speed of 
processing, or to a factor (or factors) other than speed. The results revealed that inspection time was the 
only significant predictor of Raven scores. This finding parallels that of Larson (1989), who used the 
same battery of information-processing tasks (except mental counters) on a group of 127 male Navy 
recruits. Larson found that inspection time was the only significant predictor of a measure of IQ based 
on the Armed Forces Qualifying Test (AFQT). 

Three information-processing variables predicted GATE membership: Inspection Time (ITS), 
Coincidence Tmiing (CTSD, Condition 3), and Reaction Time (5 Choice RT). The additional information- 
processing predictors of GATE membership may be attributable to the absence of 18 Raven scores. On 
the other hand, this result more likely reflects the multi-dimensional approach of the case study analysis 
used to select gifted children, as opposed to the use of a score on a single test. In either case, it is clear 
that the predominant underlying factor is speed. Both IT and RT involve speed of processing. Moreover, 
while Condition 3 of the CT task involves randomness, this task is also faster than Condition 1 of the CT 
task and equal in speed to CT2. It is reasonable to conclude that the addition of randomness in CT 
Condition 3 favored faster processors. In any case, speed of processing clearly was the predominant 
factor in the gifted-nongifted differences that were found in the information-processing tasks. 

An attempt to identify cut-off scores for the information-processing tasks failed to produce a 
reliable method of discriminating gifted vs nongifted children. Thus, while information-processing 
tasks may have relevance for our theoretical understanding of giftedness, there are no indications at 
present that such tasks could be used to make decisions about individuals. Much more work will be 
needed if such tests are to become practical. Moreover, the relationship between these elementary 
cognitive tasks and IQ or GATE membership remained low, with multiple correlations of .34 and .37, 
respectively These correlations are in line with others reported in the literature. Therefore, it would 
appear that speed of processing by itself is insufficient to account for giftedness or intelligence level, 
and that the use of a variety of elementary cognitive tasks adds littie. Thus, for a full accoimt of individual 
differences in intelligence in terms of ir\formation-processing, it would appear necessary to go beyond 
the elementary cognitive tasks and examine more complex information-processing skills as suggested 
by Sternberg (1981). In order to account for what is being measured by complex IQ tests, it appears 
necessary to examine the full range of information-processing skills. 
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Abstract 

The effect of ethnicity, gender, and risk on locus of control were 
investigated for gifted children and for high ability nongifted children. Eight 
hundred and five 5th through 7th grade African- American, Caucasians, Latino / 
Hispanic and Filipino children who had been referred for an evaluation of 
giftedness were evaluated in a case study analysis that included an evaluation 
of intellectual ability and the presence of one or more of sbc risk factors. Each 
child was given the Nowicki Strickland Locus of Control Scale for Children. A 
2(Gifted vs. NonGifted) X 4(Ethnic Background) X 4(Level of Risk) ANOVA 
revealed significant (p < .01) main effects for Giftedness, Ethnic Background, 
and Level of Risk, as well as a significant Ethnic X Risk Interaction (p < .038). In 
addition, a 2(Gender) X 2(Giftedness) X 4(Ethnic Background) ANOVA yielded 
a significant (p < .024) main effect for Gender. Overall, higher internal locus of 
control was associated with female Caucasians not at risk. However, in contrast 
to Caucasians, greater risk was associated with a higher internal locus of control 
in nonCaucasiai\s. Results confirm previous findings in showing a more internal 
locus of control in gifted children, and further indicate that for gifted and 
intellectually bright nonCaucasians, risk is associated with, and may serve to 
strengthen, a greater internal locus of control. Results support the use of locus 
of control as an alternative in identifying gifted traditionally underrepresented 
groups such as African- Americans and Latino/Hispanics. 
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Ethnic and Gender Differences in Locus of Control for Gifted and Nongif ted Children 

With and Without Risk 



Locus of control is a theoretical construct referring to the degree to which a person perceives a 
relationship between his or her own behaviors and the outcomes of those actions (Rotter, 1966). The 
most widely used instrument to measure locus of control is the Intemai-Extemal Scale developed by 
Rotter (1966). According to Rotter's theory, a person who assumes control and responsibility for the 
events in his or her life is said to have an internal locus of control. When a person attributes 
responsibility to outside sources, such as chance, luck or fate, he or she is said to have an external locus 
of control. Investigators have found that students who attribute their successes to their own efforts and 
abilities and their failures to a lack of effort will probably attempt a failed task again. Those students 
who attribute their successes to luck and their failures to lack of ability, however, are less likely to re- 
attempt a failed task (Payne & Payne, 1989; Weiner, 1977). 

In examining several locus of control studies. Rotter found strong support for the hypothesis 
that individuals who have a strong belief that they can control their own destinies are likely to a) be 
more aware of environmental factors that may influence future behavior; b) take steps to improve 
environmental conditions; c) place gTeater value on skill or achievement reinforcement; and d) be resistive 
to conformity and other subtle attempts to influence their behavior (see Rotter, 1966). In a review of the 
literature on the locus of control variable, Lefcourt (1976) found that individuals with axi internal rather 
than external locus of control were more perceptive, inquisitive and efficient in processing information. 

One of the main criticisms of Rotter's Intemai-Extemal Scale is that it is not suitable for children. 
Nowicki and Strickland have developed a measure to extend the investigation of the locus of control 
variable to children (Nowicki & Strickland, 1973). This measure has been used to examine several 
dimensions of child behavior. Empirical findings have revealed that locus of control becomes more 
intemal with age (Nowicki & Strickland, 1973; Brown et al., 1984). As children grow older, more is 
expected of them and they aie given an opportimity to succeed; thus, their perceptions of their ability to 
control their academic progress will become more intemal (Payne & Payne, 1989). Intemality has also 
been associated with higher socicd class and ethnicity, with middle class whites being most intemal 
(Nowicki & Strickland, 1973; Crandall, Katkovsky & Crandall, 1965). 

An extensive body of research has explored the relationship between locus of control and 
achievement. Nowicki and Strickland (1973) found a negative relationship between the locus of control 
and achievement of children in grades 3 through 12. As achievement scores went up, extemal scores 
went down; the children became more intemal. This was particularly tme for males. Gordon (1977) 
studied 113 fourth grade children and found that academic achievement, as measured by grades or 
achievement tests scores, could be predicted by knowing a child's locus of control score. Crandall, 
Katkovsky and Crandall (1965) measured locus of control with the lAR (Intellectual Achievement 
Response) Questionnaire, which is a measure aimed at assessing children's beliefs in reinforcement 
responsibility exclusively in intellectual academic situations. They reported tliat a higher intemality 
score on the lAR was positively correlated with at least two measures of academic achievement. 

Locus of control has been used to assess different populations. Fincham and Barling (1978), for 
example, found significant differences in locus of control between learning disabled, normal acliieving 
and gifted children, with leaming disabled being the most extemally oriented and gifted children the 
least extemally oriented. Gifted children have been identified as having a greater intemal locus of 
control than nongifted children (Delisle & RenzuUi, 1982; McClelland, Yewchuk, & Mulcahy, 1991). 
Laffoon, Jenkins-Friedman, and Tollefson (1989) looked at 137 achieving gifted, underachieving gifted 
and nongifted students in thiid, fourth and fifth grades. Their results indicated that gifted underachievers 
and nongifted students were significantly more extemal than achieving gifted students. They also 
found that the underachieving gifted and nongifted students made significantly more luck attributions 
for their failures. 
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Recently, an area of increasing interest has been in ethnic differences in locus of control (Chiu, 
1986). Hsieh, Shybut and Lotsof (1969), for example, studied Anglo- American, American Chinese and 
Hong Kong students. Their findings suggested that cultural orientation may be closely related to a 
personal belief in internal versus external control. Individuals raised in a culture that values 
independence, uniqueness, self-reliant individualism, and personal output of energy are likely to be 
more internally oriented than individuals from a culture that tends to emphasize a different set of 
values. Hsieh et al. (1969) also found Anglo-Americans to be more internally oriented than Chinese 
Americans who v/ere, in turn, more internally oriented than Hong Kong Chinese, suggesting the 
importance of the role that cultural context plays in the socialization process. In another study of 
culhire and locus of control. Battle and Rotter (1963) compared 80 black and white sbcth and eighth 
graders. These investigators foimd middle class whites to be the most internal and lower class blacks 
to be the most external. Similar findings have been shown in other studies (Nowicki & Strickland, 
1973; Gurin, Gurin, Lao & Beattie, 1%*^; Brown, Fulkerson, Fur, Ware & Voight, 1984). Studies of 
motivation and performance of black students suggest that blacks are less likely to hold strong beliefs 
of internal control and that social class and race probably interact so that lower class blacks stand out as 
externally oriented. 

In an attempt to explain differences in locus of control, Payne and Payne (1989) hypothesized 
that at-risk children will have a more external locus of control than children not considered at risk. The 
presence of certain external variables such as low socioeconomic status makes a child at risk. In a study 
of 643 black and white students classified as either at-risk or not-at-risk, Payne and Payne (1969) found 
that at-risk students had a significantly greater tendency to attribute their achievement and life 
experiences to external forces and influences. In contrast to other investigators, however, Payne and 
Payne found no main effect of locus of control for race. Browne and Rife (1991) also looked at locus of 
control orientation of at-risk and not-at-risk sixth grade students. They defined at-risk in tenns of the 
following variables: previous grade failures and retentions, attendance, prior disciplinary actions, family 
income, and number of parents in the family. They foimd that students at risk tended to have a more 
external locus of control. Studies of risk are limited, however, in that there is a lack of consistency in the 
definitions of at-risk children. In addition, previous studies have failed to consider the amount of risk 
and level of cognitive ability for each child. 

The literature shows that there is a positive relationship between locus of control and academic 
achievement. To date, however, no one has studied the effects of risk, ethnicity, cognitive ability and 
gender on the locus of control of children. The present study attempted to fill this gap in the literature. 
We examined the locus of control scores of children with no risk factors, one risk factor, two risk factors 
and three or more risk factors, as described below, to determine the effect of the presence of different 
levels of risk on locus of control. Furthermore, we compared gifted and nongifted children across 
several ethnic categories. Finally, we considered the role of gender. 



Method 

Subjects: 

The subjects consisted of 805 fifth through seventh grade students who had been referred for 
an evaluation of giftedness and whose parent provided informed consent to include the Locus of Control 
Scale as an experimental test whose results would not be used in determirung giftedness. Of these, 190 
were Latino /Hispanic, 341 Caucasian, 155 African-American, and 119 Filipino; 435 were female, 370 
male. 

Procedure: 

Each child was evaluated for giftedness in a case study evaluation by a school psychologist. 
The case study considered scores on tests of intelligence and achievement. In addition, each child was 
evaluated for the presence of risk factors. A risk factor was operationalized as the presence of one or 
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more of tl- .allowing six variables: environmental, economic, language, culture, social/emotional and 
health. Environmental risk included transiency (three or more changes in schools) and excessive absence 
from school because of home responsibilities such as child care duties or working to help support the 
family. Economic risk included parental unemployment or low income. Language risk included speaking 
English as a second language and lack of fluency in English. Cultural risk included cultural values and 
beliefs that differ from those of the dominant culture or limited experience in the dominant culture. 
Emotional risk encompassed such factors as death of a parent, child abuse, major psychiatric illness in 
the home, or extended absence of the parent because of military service. Health factors included vision, 
speech, and hearing deficits that required designateH ^iistructional service; motor problems that required 
adaptive physical education; or diseases that caused absences or hampered school progress, such as 
asthma or diabetes. Of the subjects, 85 had no risk factors, 114 had one risk factor, 236 had two risk 
factors, 370 had 3 or more risk factors. 

At the time of their evaluation for giftedness, each child was given the Nowicki Strickland 
Internal External Locus of Control Scale for Children (NSLOCS). The NSLOCS is a 40 item paper and 
pencU instrument in which the child answers "YES" or "NO" to such questions as "Do you beUeve that 
most problems will solve themselves if you just don't fool with them?" and "Do you beUeve that whether 
or not people like you depends on how you act?". A high score on the NSLOCS indicates extemaUty. 
Nowicki and Strickland have reported internal reliability coefficients ranging from .68 to .81 . The NSLOCS 
has also been found to correlate with other measures of locus of control such as the Rotter I-E Scale 
(Rotter, 1966) and the Bialer-Cromwell Scale (Biaier, 1961). 

Data on Locus of Control were not made avaUable to the school psychologist who conducted 
the giftedness evaluation. The psychologists used objective test results to determine giftedness. AU 
children who obtained an IQ score two standard deviations above the mean on a standardized test of 
intelligence such as the Wechsler Intelligence Scale for Qiildren-Revised (WISC-R) were automaticaUy 
certified for the gifted program. In addition, children with two or more risk factors who obtained an IQ 
score of 120 or greater were certified as gifted. For the purposes of the present analysis, only those 
children who scored two standard deviations above the mean on a standardized individual intelligence 
test were included as "gifted." Of the total sample, 364 met the criterion. Of these, 131 were Latino/ 
Hispanic, 147 were Caucasian, 24 were African-American, and 62 were Filipino. 



Results 



To examine risk factors and possible interactions of risk with giftedness and ethnicity, the data 
were analyzed in a 2(Giftedness: Gifted vs. NonGifted) X 4(Ethnic Background: Latino/Hispanic, 
Caucasian, African- American, Filipino) X 4(Level of Risk: 0, 1, 3, risk factors) ANOVA. There were 
significant main effects for Giftedness, F(i,804 )= 8.31, p < .004, Ethnic Background, F(3, 804) = 6-29, p < 
.001, and Level of Risk, F(3,804) = 2.84, p < .037. Also significant was the Ethnic X Risk Interacfaon, F(9, 
804)= 1.99, p<. 038. 

The main effect for Ethnic Background showed that the Caucasian children (M = 13.74, SD = 
4 57) had a significantly more internal locus of control than the Filipino children (M = 14.73, SD = 4.37), 
who, in turn, were significanUy more internal than Latino (M = 15.54, SD = 4.96) and African-American 
children (M = 15.90, SD = 4.40). 

The main effect for risk showed that children with two (M = 15.27, SD = 4.86) and three or more 
risk factors (M = 15.01, SD = 4.49) had a significanUy more external locus of control than children wiUi 
only one risk factor (M = 13.42, SD = 4.56) indicating that, summing across the other variables, the more 
risk factors a chUd has, the more likely he or she wiU be to attribute successes to external causes. 

Figure 1 illustrates the Ethnic Background by Level of Risk Interaction. For the Caucasians, 
having no risk factors was associated with a higher internal locus of control, and the children became 
more external as risk level increased. For the other three groups, with the exception of one data pomt 
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(Latino /Hispanic with one risk factor), the reverse was foxind. That is, for these nonCaucasians, greater 
risk was associated with a higher internal locus. 



Figure 1 . Locus of control as a function of risk and ethnic background 




LATINO/HISPANIC 
CAUCASIAN 
AFRICAN-AMERICAN 
FILIPINO 



NONE ONE TWO THREE OR MORE 
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To evaluate gender effects and possible interactions among gender, giftedness, and ethnic 
background, the data were evaluated in a 2(Gender) X 2(Giftedness) X 4(Ethnic Background) ANOVA. 
As with the previous ANOVA, there were significant main effects for Giftedness and Ethnic Background. 
The main effect for Gender was also significant, F(i^ 804) = 5.15, p < .024. TTiis effect was due to a higher 
internal locus in the females (M = 14.40, SD = 4.65) compared to the males (M = 15.23, SD = 4.67). 



Discussion 

The results revealed that, overall, gifted children have a more internal locus of control than 
nongif ted children. This result is not surprising, and is consistent with findings from Laffoon, Jenkins- 
Friedman, & Tollefson (1989). Laffoon et al, (1989) found that high achieving gifted students had a 
significantly higher internal locus of control than both underachieving gifted and nongifted students. 
Other studies have also found internal locus of control to be related to giftedness (Fincham & Barling, 
1987) and achievement (Keith et al., 1986; Harty et al, 1984). 

There were also ethnic differences in locus of control. The results of the present study revealed 
that Caucasian children have a more internal locus of control than Filipino, Utino, and African-American 
children. These findings are consistent with Hsieh et al. (1969), who found Anglo-American students 
to have a more internal locus of control than other ethnic groups. These findings suggest that cultural 
orientation may be closely linked to an individual's belief in external or internal control. As Hsieh et al. 
(1969) suggest, an individual who is raised in a culture that emphasizes independence, self-reliance, 
and individuation will typically develop a more internal locus of control than individuals who are 
^raised with a different set of values. This is an important finding considering the positive relationship 
between internal locus of control and achievement. If children, and specifically traditionally 



underrepresented or disadvantaged children, are miderachieving in school, one possible area on which 
to focus may be increasing the child's internal locus of control. The discrepancies in inteUigence and 
achievement scores may be due to differences in socialization processes as opposed to deficits in 
intellectual abilities. 

Level of risk also affected locus of control. Overall, children with two or more risk factors had 
a significantly more external locus of control than children with none or only one risk factor. Browne & 
Rife (1991) and Payne & Payne (1989) also found that at-risk students tended to ascribe their achievements 
and successes to outside, "external" causes. A child is generally considered at-risk if specific 
circumstances in the chUd's life affect his or her abiUty to perform at full potential. The child typicaUy 
has Uttle or no control over these variables. Language and cultural factors, for example, may influence 
a child's abiUty to perform in school, yet the child has no control over what culture into which he or she 
is bom. It foUows that the presence of multiple risk factors may lead the child to feel that he or she has 
little control over his or her envirorunent and thus, develop an external locus of control. 

Findings with level of risk must be considered in conjmtction with ethnic background, as there 
was a significant interaction between risk and ethnic background. This interaction indicated that for 
nonCaucasians the presence of risk factors was actually associated with a greater internal locus of 
control. Perhaps these children respond to adversity by becoming more internal. In any case, there 
was a clear association between risk and internal locus of control for gifted nonCaucasians. Whether 
these results generalize to children of average or below average ability, however, remains to be seen, 
and would seem unlikely based on previous studies (Brown & Rife, 1991; Payne & Payne, 1989). Thus, 
a high internal locus may serve as a marker for at risk gifted nonCaucasians. 

There were also gender effects in locus of control. Females showed a slight but significantly 
greater internal locus of control. These results are consistent with Young and Shorr (1986) and Cooper 
et al. (1981), who found that females tended to attribute both success and failure outcomes to internal 
causes sigi^cantly more often than males. This difference may be a result of differences in socialization 
of girls and boys. However, several other studies have failed to find any differences in locus of control 
between males and females (Payne & Payne, 1989; Browne & Rife, 1991; Brown et al., 1984). Furthermore, 
while the differences in locus of control for the present study were significant, in general all subjects m 
this gifted and high ability sample tended to be more internal. More research is needed in order to 
determine the extent of differences in locus of control for males and females of average or below average 
abilities. 

In general, our results further support the conclusion that internal locus of control is positively 
related to academic success, including identification for gifted programs. In addition, female shidents 
were found to be more internal than male students. Several factors have been identified that appear to 
influence an individual's locus of control. One factor that may lead to a more external locus of control 
is cultural orientation. The extent to which a culture beUeves in independence and self-reUance will 
possibly effect the degree to which a person perceives control over his or her successes and faUures. A 
second variable that appears to affect internal versus external locus of control is the number of risk 
factors attributed to the individual. It appears that the more risk factors a child has, the less mtemal 
control the child perceives. However, the effects of number of risk factors must be looked at m conjunction 
with ethnicity and IQ. The results of the present study suggest that for high IQ scoring nonCaucasians 
the presence of multiple risk factors may actuaUy serve to increase internal locus of control. Thus, locus 
of control has multiple determinants and interacts with risk and ethnicity. 

Future work in this area should be du-ected at examining, more specificaUy, how measures of 
locus of control might be used to produce equity in selecting diverse children for gifted programs. In 
the present shidy locus of control was used as an experimental tool. Scores on locus of control were not 
used for selection purposes. Cenamly such measures have shown promise and merit further 
investigation. 
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Abstract 



A well defined sample of gifted imderachievers was compared to a sample of 
gifted high-achievers. All children had full scale WISC-R IQ scores of 130 or greater. 
Underachievers were performing at or below the 50th percentile in at least one major area 
of achievement, whereas high-achievers were at the 96th percentile or greater in three 
areas of achievement: language, math, and reading. Of 6,067 children who had obtained 
full scalf* IQ scores of 130 or greater over a nine year period in an ethniccdly diverse 
population, 108 met criteria for gifted underachievement, and 96 met criteria for high 
achievement. Results of a 2 (achievement level) by 9 (WISC-R subtest) mixed repeated 
measures ANOVA revealed significant (p < .01) differences in scores on four verbal subtests: 
Information, Similarities, Vocabulary, and Comprehension. High-achievers had 
significantly (p < .001) higher Verbal, but not Performance, IQ scores than underachievers. 
However, comparison of the VIQ-PIQ discrepancy distributions for the children in the 
two groups revealed no significant differences. This finding negates the idea that a large 
VIQ-PIQ discrepeincy can be used as an indicator of risk for low achievement in gifted 
children, since large VIQ-PIQ discrepancies were as likely to be seen in high-achievers as 
in low. Analysis of gender, ethnicity, and risk revealed a greater concentration of 
nonCaucasian males with at least two risk factors in the underachieving group. Present 
findings are consistent with, and confirm those of others concerning the importance and 
discriminating power of the Information subtest in distinguishing high versus 
underachievers. The findings indicate that gifted underachievers are not as motivated or 
interested in acquiring traditional factual information as high-achievers. Creative teaching 
strategies are recommended to maximize the talents of underachievers. 
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Understanding Gifted Underachievers in an Ethnically Diverse Population 

Traditionally, gifted underachievers are defined as those children who cannot or will not perform 
at a level of academic achievement commensurate with their intellectual potential (Emerick, 1989; Fine, 
1967; Gowan, 1955). Underachievement manifests itself as a discrepancy between a child's performance 
in the classroom and his or her inteUectual abiUty (Rknm, 1988); a discrepancy between what is expected 
and what is actually accomplished (Newell, & d'Lberville, 1989). 

As Gowan (1955) noted some time ago, gifted underachievers represent one of the greatest 
social wastes in our culture. Gifted underachievers are children of exceptional ability who achieve at 
average or even below average levels. Unforhmately, gifted underachievers tend to be overlooked 
because such children perform at relatively good levels (Wolfe, 1991). Thus, while gifted children may 
be as susceptible to factors that cause underachievement as are children of normal intelligence, their 
underachievement is less likely to be recognized because of their giftedness (Supplee, 1989). 

According to various estimates, between 10% and 20% of high school dropouts are judged to 
have very superior ability (Lajoie & Shore, 1981; Nyquist, 1973; Whitmore, 1980). Ten to fifteen percent 
of the academically gifted are believed to achieve at a rate far below their potential (Gallagher, 1985; 
Ford, 1992). Consequentiy, an increasing number of researchers have called for more study of these 
gifted children who fail to fulfill their high potential and yet are so easily missed, or dismissed as lazy, 
manipulative or irresponsible (Emerick, 1989; Ford, 1992; GaUagher, 1985; Wolfle, 1991). 

Because they usually perform at satisfactory levels, gifted underachievers are difficult to identify, 
and consequently have proved resistant to systematic scientific inquiry. Much of the available research 
has been devoted to the problem of identification (Ford & Harris, 1990; Mather & UdaU, 1985). Some 
studies have looked at the role of the family (VanTassel-Baska, 1989) or socioemotional consequences 
(Cornell, Callahan, & Lloyd, 1991). Still other reports have used case studies to examine gifted 
underachieving children (Hannel, 1990), 

A major line of investigation has been concerned with test patterns in order to better understand 
and operationalize underlying correlates of underachievement in gifted children. The basis for the 
search for test patterns can be found, in part, in evidence that suggests that gifted children process 
information in a qualitatively different manner from average children on tests such as the WISC-R 
(Brown & Yakimowski, 1987). Moreover, in a study of WISC test patterns of bright and gifted 
underachievers. Bush and Mattson (1973) found that normal achievers and underachievers differed on 
three subtests: Information, Arithmetic, and Digit Span. In a related study, Moffitt and Silva (1987) 
examined children from an unselected birth cohort who had WISC-R Verbal and Performance IQ 
discrepancies that placed them beyond the 90th percentile. Underachieving children in this sample 
were foimd to have depressed a Verbal IQ relative to the Performance IQ. 

In the present study, we continued the exploration of the test patterns of gifted underachievers. 
A well-defined sample of gifted underachievers was compared to a vvellniefined sample of gifted high- 
achievers, to expUcate differences in patterns of subtest scores as weU as verbal-performance discrepancies. 



Method 

Subjects: , r j • i.u 

The subjects were drawn from children who were referred for an evaluation of giftedness m the 
San Diego School District between 1984 and 1993. The San Diego City School District consists of over 
123,000 children who attend more than 130 elementary, middle, and high schools across a wide geographic 
and ethnically diverse area. Children may be referred for a giftedness evaluation by teachers, parents, 
or central nomination from the District office. Each child referred is examined by a school psychologist 
who conducts a case study analysis including a consideration of IQ, achievement, aptitude, and nsk 
factors. For each child evaluated, the psychologist determines whether the child has one or more of five 



risk factors as follows: cultural/ language, economic, emotional, environmental, and health. Children 
who score two standard deviations above the mean on a standardized IQ test are automatically certified 
as gifted. Children may also be certified as gifted based on a combination of high achievement, high IQ 
and risk factors. 

Procedure: , , , ^ , nco/\ 

Between 1984 and 1991 the vast majority of children evaluated for giftedness (more than 95 h) 
were administered the Wechsler Intelligence Scale for Children-Revised (WISC-R). A total of 9,315 
children had been given the WISC-R during this time period. From these 9,315 chUdren, we identified 
all who had a FuU Scale IQ of 130 or greater (i.e., two standard deviations above the mean or greater). A 
total of 6,067 children met this criterion. From this group of high IQ children, a group of underachievers 
was obtained by selecting aU children who scored at the mean (i.e., 50th percentile) or lower on the Total 
Reading, Language, or Math scores of the Comprehensive Test of Basic Skills (CTBS). Such children 
would therefore have at least a two standard deviation discrepancy between IQ and achievement. A 
total of 108 children representing an ethnicaUy diverse sample met this criterion. To obtam a high- 
achieving group, aU children who had aU three achievement scores in the 96th percentile or higher were 
selected. A total of 96 children met this criterion. For the underachievers, 11 were Latino, 73 Caucasian, 
10 African-American, 7 Asian, and 7 Other (Native-Ameiican, Indochinese, Filipino, or Pacific Islander). 
For the high-achieving sample, 2 were Latino, 78 Caucasian, 4 African- American, 2 Asian, and 10 Other. 
Chi Square analysis was conducted to determine if there were significantly more nonCaucasians 
represented in the underachieving group. Compared to an expected 50-50 spUt, results revealed that 
there were significantly more nonCaucasians in the underachieving group x^(l, N = 108) = 5-45, (p < 
.02). There were no significant differences in numbers of Caucasians in the two groups (p > .05). In 
t-rms of risk factors, 56 high-achievers had none, 23 had one, and 17 had two. In the underachieving 
group 61 had no risk factors, 23 had one risk factor, and 24 had two risk factors. There were no statisticaUy 
significant differences between the two groups in the number of children in each risk category. In the 
high-achieving group, 49 were female, 47 male. In the underachieving ^oup, 40 were female and 68 
were male. Males were overrepresented in the underachieving group, x^(l, N = 108) = 7-26, p < .007. 



Results 



Table 1 shows the mean subtest performance for the nine subtests that were routinely given to 
the majority of the subjects. To evaluate differences in subtest performance between high and 
underachievers, the groups were compared in a 2 (Achievement Level) X 9 (Subtests) mixed repeated 
measures ANOVA. Hiere were significant main effects for Achievement, F(i, i3i) = 17.16, p < .001, and 
for Subtests F(8 i048) = 35.14, p < .001. Post-hoc multiple comparisons revealed that the high-achievers 
scored sigAific'antly higher (p < .01) on 4 subtests: Information, Similarities, Vocabulary, and 
Comprehension. 

WISC-R Means and Standard Deviations for Subtest Performance by High and Underachievers. 
WISC-R Subtest 



Information 
Similarities 
Arithmetic 
Vocabulary 
Comprehension 
Picture Completion 
Picture Arrangement 
Block Design 
Object Assembly 



High-achievers 
(M) (SD) 


Underachievers 
(M) (SD) 


Entire Sample 
(M) (SD) 


14.78 (2.09) 


13.88 


(1.87) 


14.30 (2.02) 


17.35 (1.99) 


16.65 


(1.97) 


16.97 (2.01) 


14.73 (2.33) 


14.40 


(2.15) 


14.55 (2.24) 


15.98 (2.08) 


15.40 


(2.08) 


15.67 (2.09) 


17.23 (1.76) 


16.13 


(1.97) 


16.69 (1.94) 


13.76 (2.44) 


13.90 


(2.14) 


13.83 (2.44) 


14.70 (2.47) 


14.67 


(2.51) 


14.68 (2.49) 


15.39 (2.53) 


14.82 


(2.51) 


15.09 (2.53) 


14.44 (2.86) 


14.01 


(2.73) 


14.21 (2.79) 
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Since all of the differences found were for Verbal subtests, high-achievers and underachievers 
were compared in a 2 (Achievement Level) X 2 (Verbal versus Performance IQ) ANOVA. PIQ scores for 
high-achievers (M = 132.2; SD = 9.7) and underachievers (M = 130.6; SD = 9.1) did not differ significantly. 
However, VIQ scores did differ significantly, F(i, 202) = 13.5, p < .001, with a mean of 137.8 (SD = 8.5) for 
the high-achievers and a mean of 133.4 (SD = 7.8) tor the underachievers. 

To investigate the possibility that high-achievers and underachievers differ in individual VIQ 
- PIQ discrepancy scores, VIQ - PIQ frequency distributions for the two groups were compared. No 
significant differences were found (Kolmogorov-Smimov Z = 1.007, p = .263). As can be seen in Figure 
1, relatively large VIQ - PIQ discrepancies were as likely to be seen in high-achievers as in low achievers. 



Figure 1 . Distribution of VIQ - PIQ differences at the extremes of achievement. 
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Performance was further analyzed through correlational analysis and stepwise multiple 
regression (see Table 2). Table 2 shows the intercorrelation matrix of the WISC-R subtests and CTBS 
(Language, Reading, and Math) scores. Stepwise multiple linear regression was performed, with the 
nine Wechsler subtest scores as predictors and level of achievement (i.e., whether the child was in the 
high versus underachievement group) as the criterion. Three subtests were significant in predicting 
achievement level. The first variable that entered into the equation was the Information subtest, with a 
multiple R of .29, F(i, 131) = 12.08, p < .001. The Comprehension subtest added significant variance, 
F(2, 30) = 10.1, p < .001, and increased the multiple R to .37. Finally, the Block I>esign subtest significantly, 
^(3, 129) = 8.5; p < .001, increased the multiple R to .41. 



Table 2. 

Correlation Matrix for WISC-R Subtests and Achievement Scores 





INFO 


COMP 


ARTTH 


SIMS 


VOCAB 


PC 


PA 


BD 


OA 


INFO 


1.00 


















COMP 


.22 


1.00 
















ARTTH 


.11 


.07 


1.00 














SIMS 


.07 


.18* 


.09 


1.00 












VCXIAB 


.40** 


.24** 


.11 


.29** 


1.00 










PC 


-.01 


-.11 


.02 


-.07 


.07 


1.00 








PA 


-.08 


-.11 


.02 


.11 


.06 


.12 


1.00 






BD 


.02 


-.16 


-.07 


-.12 


-.06 


.06 


.03 


1.00 




OA 


-.11 


-.07 


-.06 


.08 


.09 


.18* 


.36** 


.31** 


1.00 


CODING 


-.21* 


-.05 


.20* 


-.06 


-.03 


.01 


-.05 


-.07 


-.12 


CTBSL 


.26** 


.29** 


.05 


.10 


.15 


-.10 


-.04 


.09 


.10 


CTBSR 


.23** 


.23** 


.02 


.10 


.12 


.02 


.01 


.11 


.11 


CTBSM 


.05 


.17 


.07 


.12 


-.04 


-.04 


.01 


.16 


.03 



CODING CTBSL CTBSR CTBSM 



CODING 1.00 

CTBSL .08 1.00 

CTBSR .11 .7r* 1.00 

CTBSM .09 .52** .40** 1.00 



Note,: CTBSL = CTBS Total Language 
CTBSR = CTBS Total Reading 
CTBSM = CTBS Total Math 

* p < .05 
** p < .01 



To aid in understanding the gender and ethnic differences between high-achievers and 
underachievers, the groups were further compared in terms of gender, risk, and ethnicity simiiltaneously. 
Chi Square analysis revealed sigruficantly more male, nonCaucasians with 1 or more risk factors in the 
underachieving group: x^(l/ N = 33) = 5.50, p < .019. 
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Discussion 



The present sn'.dy compared intellectually gifted children who were achieving at least two 
standard deviations below expectation to a very high-achieving sample. In general, the high-achieving 
sample had sUghtly higher IQ scores. This superiority, however, was attributable only to Verbal subtests. 
The high-achievers had significantly higher scores on Information, Similarities, Vocabulary, and 
Comprehension. Although mean Verbal IQ was significantly higher for the high-achieving group, 
there were no differences for any of the Performance subtests or for the Performance IQ as a whole. 

Differences in the pattern of individual VIQ - PIQ discrepancy scores were plotted in frequency 
distributions for each group. No differences were found between the two distributions. Indeed, VIQ - 
i^IQ discrepancies on the order of 15 points or greater were found to be equally common in both high- 
achievers and underachievers. This finding underscores the faUacy of confusing statistical significance 
(i.e., the 15-point difference necessary to conclude with 95% certainty that an individual's VIQ and PIQ 
differ) with clinical significance (i.e., the mistaken conclusion that a 15-point VIQ-PIQ difference 
necessarily has prognostic significance and indicates risk for underachievement). Large VIQ-PIQ 
discrepancies are equaUy common in high- and low-achieving gifted children; only with the addition 
of a low achievement test score can a low-achiever be identified. In terms of predicting achievement 
level using WISC-R subtests, the primary correlates were Information and Comprehension, with Block 
Design adding a small, but significant, contribution to the variance. 

Analysis of gender, ethnicity, and risk further revealed a greater concentration of nonCaucasian 
males with at least two risk factors in the underachieving group. These findings are consistent with 
previous studies, which indicate that of the inteUectual underachievers, males outnumber females 
(GaUagher, 1985; Wolfle, 1991), and that many of these students are ethnic minorities (Ford, 1992). Thus, 
there is a need, as advocated by GaUagher (1985), to provide particular focus on underachieving minority 
males. 

Our results confirm the findings of Bush and Mattson (1973) concermng the importance and 
discriminating power of the Information subtest in distinguishing high-achievers versus underachievers. 
Present findings show that the older results with the WISC generalize to the WISC-R. The findings 
pertaining to the Information subtest, taken at face value, seem to suggest that gifted underachievers 
simply do not have as much interest or motivation for acquiring factual ir^formation as do high-achievers. 
This suggests that gifted underachievers may require creative teaching strategies, such as making 
information more relevant and interesting or channeling their abilities into more creative pursuits. 

Our findings are also consistent with those reported by Moffitt and Silva (1987). Gifted 
underachievers are characterized by certain depressed verbal skills; their Performance IQ's are 
comparable to that of the high-achievers. Thus, we can characterize the gifted underachiever as an 
individual who has not used his or her potential, or as CatteU (1963) would say, fluid inteUigence, to 
acquire a traditional body of knowledge (i.e., crystallized inteUigence). Again, the chaUenge for teachers 
is to find ways to motivate these tmderachievers to make fuU use of their potential. 

Our findings pertaining to gifted underachievers are also relevant to a previous report of the 
direction of the difference between Verbal versus Performance IQ in 4,546 gifted African-American, 
Caucasian, FUipino, and Hispanic chUdien (Saccuzzo, Johnson, & RusseU, 1992). This study showed 
that for the typicaUy gifted African- American, the Verbal IQ was actuaUy higher than the Perfonnance 
IQ For Hispanics, the Verbal and Performance IQ's were roughly equivalent. Thus, the relevant 
dimension includes both direction and size; a very high Performance IQ relative to Verbal IQ for an 
African-American and perhaps an Hispanic should signal the possibUity of a gifted underachiever 
because these individuals tend to have higher Verbal than Performance IQ's. For FUipmos, just the 
reverse is true since these individuals tend to have higher Performance than Verbal IQ (Saccuzzo et al, 
1992). Therefore, whUe the WISC-R may be biased in terms of selection Qohnson, 1992), it (or its relative, 
the WISC-m) may stUl have utility in identifying gifted underachieving African-American and Hispanic/ 
Latino students. 
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Beyond modification of our educational strategies, researchers have pointed to three major 
approaches to gifted underachievers. The first focuses on motivational factors (e.g., Boyd, 1990). 
According to this model there is a need to add excitement or relevance to the learning process in order 
to help gifted underachievers fulfill their potential A second approach emphasizes the importance of 
families as a source of encouragement and support for gifted underachievers (VanTassel-Baska, 1989). 
The third emphasizes the importance of personality variables, especially locus of control — one's 
perceived ability to influence or control the events of one's life (Laffoon, Jenkins-Friedman, & ToUefson, 
1989; Waldron, Saphire, & Rosenbaum, 1987; Willings & Greenwood, 1990). Certainly any one, two, or 
all three of these factors play a role in gifted imderachdevement and need to be considered in addressing 
tlie problems of each individual and unique student. 
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